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(1) Introductory. In a recent investigation as to school children by Williams, 
Bell and Pearson*, it was shown that in a series of seven schools temperature 
was negatively correlated with weight for constant age, and further in six out 
of the seven school series negatively correlated with stature for constant age. The 
correlations were on the whole small, but definitely significant. In the great public 
schools the temperature appeared, allowing for age, to be lower than in the 
elementary schools. An association was thus suggested between intelligence and 
temperature, the more intelligent having the lower temperature. This result 
might well be considered as spurious, and due to differences of nurture, particularly 
to differences of nutrition. On the other hand various writers have asserted 
that low temperatures are associated with low intelligence, and it is usually stated 


* “A Statistical Study of Oral Temperatures in School Children,’ Drapers’ Company Research 
Memoirs. Studies in National Deterioration, No. 1v (Cambridge University Press). See pp. 57 and 59. 
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that the temperature of idiots is lower than that of normals. Such a conclusion 
might easily be reached if great care were not taken in keeping the mouths of the 
mentally defective closed. Norsworthy has given a descending scale of temperature 
from brightest girls to mediocre girls and so through mental defectives to idiots*. 
But the numbers dealt with were very inadequate, and the time of insertion very 
short. It appeared therefore well worth while considering further certain tempera- 
ture problems, and the Biometric Laboratory was lucky enough to have placed at 
its disposal observations by Dr Charles Goring on 500 convicts. Of these convicts 
400 were of normal intelligence and 100 were classed as weak-minded. In the bulk 
of cases the temperatures were taken twice at an interval of 14 days and upwards, 
the temperature of the room was also taken. Further the pulse and respiration 
were observed on both occasions. The examinations were all made at the same 
time, about a quarter of an hour after early dinner, and in the same condition,— 
the prisoner would be sitting in his cell, either reading, writing or doing nothing. 
When the thermometer had been inserted under his tongue, the medical officer 
would count the frequency of his pulse during one minute, and then would count 
respirations during one minute by observation of abdominal movement; the 
prisoner himself had of course no knowledge that this observation was being made. 
Besides the above data, the age, stature, weight, nature of the crime and previous 
convictions, the existence of special disease, the general health, the physical 
condition with regard to nutrition and muscularity, as well as the class of prison 
labour were all recorded. Only some of these data are discussed in the present 
paper. 

Of the characters dealt with, stature, age, weight, pulse, respiration and tem- 
perature are quantitatively determined, intelligence is in two classes, normal and 
feeble-minded; the other data are in broad categories. Thus General Health is 
classed as “Poor,” “Indifferent,” or ““Good”; for physical condition there are 
two divisions, each in alternate categories, (i) “Fat” or “Thin” and (ii) ‘“ Muscular” 
or “Weak”; labour done by the convict is classed into “Heavy” and “Light” ; 
and this forms an adequate classification by which to test health conditions and 
other physical categories. 


(2) Temperature. The temperatures were oral, taken with a Kew certified 
minute thermometer inserted under tongue with closed mouth for three minutes. 
None of the subjects were hospital invalids, all at the time of the observations were 
getting about, but of course a good many were suffering from chronic disease or 
disorders of one kind or another. The following classification of these diseases 
and disorders was made: 


I. Syphilis: disease apparently inactive at time of observation. 


II. Chronic Bronchitis or Asthma, or both: free from attack at time of 
observation. 


* “The Psychology of Mentally Defective Children,” Archives of Psychology, November, 1906. 
See also Pearson: Mendelism and Mental Defect, No. ut (Cambridge University Press), p. 27. 
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III. Organic Heart Disease. 

IV. Chronic Tubercular Disease: quiescent at time of observation. 
V. Cardio-vascular degenerative changes. 

VI. Epilepsy. 


VII. Functional Heart Disorders without organic disease, and described as 
“weak heart.” 


VIII. Kidney degenerative changes: indicated by albuminuria, etc. 


IX. Other pathological states: Necrosis of bone of old standing; diabetes, 
active at time of observation; tabes dorsalis (active); general 
paralysis of the insane (active); chronic rheumatism (active); 
urethral stricture, resulting from gonorrhoea; previously insane; 
goitre (active); cirrhosis of liver (active); hemiplegia, resulting 
from a stroke. 


927 temperatures were taken on 500 individuals, 427 being repeated tempera- 
tures, the intervals between observations varying from 2-3 weeks to 2-3 months. 
326 temperatures were those of criminals suffering from the diseases and disorders 
classed as above, the remaining 601 temperatures may be taken as those of fairly 
normal individuals, differentiated only from the same class of the population by 
the fact of their criminality and by the action of prison diet, discipline and labour. 
Out of the 400 mentally normal convicts 34:5 °/, were suffering from one or another 
disorder, and of the 100 weak-minded convicts, 27°. This is explained by the 
fact that Parkhurst, where the observations were made, is largely a prison to which 
the physically or mentally feeble are sent, and accordingly it is more than probable 
that the weak-minded in general suffer at least as much from disorder and disease 
as the normal-minded criminal. Taking apparently inactive syphilis alone we 
find 12-75 % of the normal-minded and 11-00 % of the weak-minded* suffering 
from non-active disease, a nearer ratio of equality. It would be of great value 
if a Wassermann test could be made throughout a large convict establishment. 
It seems probable that at least 10 °/, of the inmates would be found to be syphilitic. 
Making due allowance for the class from which the bulk of criminals are drawn, 
this does not suggest that much more than 5 %-6 % of the non-criminal population 
are likely to be affected, and compares on the whole favourably with the high 
estimates published of the syphilised population in Germany}. The following 
table sets forth our results for mean temperatures: 


* One case of general paralysis of the insane included, and in both classes the probably syphilitic 
also counted. : 

} Fritz Lenz: ‘Ueber die Verbreitung der Lues...,” Archiv fiir Rassen- und Gesellschafts- 
biologie, Bd. vit. S. 306-327, 1910. 
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TABLE A. Mean Temperatures of Various Classes. 


Class No. of Temperatures Mean Temperatures 

| 
| ‘Weak Heart” (VII)... ae aa 20 98-585 -+ -068 
| Weak-minded = ae nee Sed 191 98-547 -- -022 
Kpilepsy (VI) ze ee ae oe 27 98-537 -- .059 
Cardio-vascular changes (V) me dae 36 98-408 -+ -050 
Normally Healthy Criminals... Ha 601 98-383 + -013 
Whole Population ae fae 927 98-373 + -O11 
Chronic Bronchitis. Asthma (II) | 51 98-353 + -043 
Kidney changes (VIII)... oh age | 14 98-350 + -082 
Syphilis (I) a the =e 97 98-342 + -031 
Organic Heart Disease (IIT) Re 6 32 98-278 + -054 
| Chronic Tubercular Disease (IV) an 22 98-241 + -065 
| Other pathological conditions (IX)... _| 27 98-170 + -059 


Now it must be at once admitted that many of the series here are so small, 
that we cannot say that the temperature differences, still less the general tempera- 
ture order, are significant, but certain differences do appear significant. For 
example, the tents minded do appear to have a significantly higher ayereee 
temperature than normally healthy criminals. The difference, 


weak-minded — normally healthy = -164 + -026, 


is Clearly significant numerically, but this does not show that the higher temperature 
is inherent in the weak-mindedness. We have already seen that of the weak- 
minded 27 9% were suffering from one form or another of disorder, while we are 
comparing them against a normally healthy criminal population. The only groups, 
however, that could tend to raise the temperature of the weak-minded are V, 
VI and VII. To these groups the weak-minded contribute 


V.  Cardio-vascular degenerative changes: 4 temperatures. 
VI. _ Epilepsy: 16 a 
VII. “Weak Heart”: 0 


9? 


These 20 temperatures have therefore been removed and we find: 
Mean temperature of weak-minded = 98-541 + -021. 


Thus the removal of the epileptic has only lowered very slightly the temperature 
of the weak-minded group. That has indeed 8-4 % of the temperatures of the 
epileptic as against only 1-2 °% in the group of normal-minded criminals*, but 
the difference of temperature is only in the very smallest degree due to this excess 
of epilepsy. One point, however, of some apparent importance had it been based 
on more numerous data was observed, namely, that the 18 temperatures of the 
weak-minded who were suffering from syphilis, apparently inactive at the time 
of observation, gave a high mean, i.e. 98-683, as against the fact that the general 


* Reckoned on individuals, not on temperatures observed, the percentages are 9:0°% and 1:5 % 
respectively. 
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syphilised population had if anything a rather subnormal temperature; but the 
removal of these 18 temperatures gave for the weak-minded population a mean- 
temperature of 98-533 + -021, which although slightly reduced is still significantly 
greater than that of the normal criminal population. I think we must conclude 
that none of the observed pathological states is the source of the higher temperature 
of the weak-minded, but whatever that source is, it is intensified in the presence 
of either epilepsy or syphilis*. 
f In the paper by Williams, Bell and Pearson, already cited, it is shown that 
body temperature falls with age up to the adult stage. From the present data 
we see that from age 22:5 to age 85:5 the fall is continued. Table XXXIV gives 


Dracram I. Regression Line of Temperature on Age for Criminals. 
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the data and Diagram I shows the plotted regression line. If 7 be the temperature 
in degrees Fahrenheit, A the age in years, the equation to the regression line is: 
T = 98-5915 — -004989 A, 
which indicates that the temperature falls -005 or gio of a degree per year. The 
problem then arises—can the difference of temperature between the weak-minded 
* Weak-minded epileptics, 98-669; criminal epileptics in general, 98-537; weak-minded syphilitics, 
98-683; criminal syphilitics in general, 98-342. These results are open to the interpretation that the 


weak-mindedness may be owing to cerebral injury due to the disease, and that cerebral disorders are 
more apt than other forms of these diseases to be accompanied by higher temperatures, 
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and normal criminal be solely an age difference? An examination of Table B 
shows that the mean age of the weak-minded is 38-16 years and of the normal- 
minded 44-86 years, or the former are younger by 6-70 years. This would correspond 
only to an average greater temperature of -034 degrees as against the observed 
difference of -164 degrees. The temperature difference of weak- and normal-minded 
is therefore not due to age difference. This is confirmed in Diagram II, where 


DriacraM II. Regression Lines of Temperature on Age for Feeble- and Normal-Minded. 
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the observations and the calculated regression lines for the weak-minded and the 
normal-minded groups* are plotted alongside each other. Notwithstanding the 
irregularities due to the smallness of the sample, it will be seen that at every age 
the mentally defective have a higher temperature than the normal-minded, and 
further that, as averaged out by the regression lines, the difference at each age is 
sensibly constant (the regression lines are practically parallel) and equal to 0°-2. 


We conclude that weak-minded adults have a higher temperature than normal- 


minded adults and in this respect differ from mentally defective children who are 


* The equation to the regression line of the weak-minded group is 7’=98-6535 —-002796.4, and 
of the normal-minded group 7 =98-5063 —-003939A. 
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stated by Professor Norsworthy* to have a lower temperature than normal- 
minded children, or else Norsworthy’s material by including idiots was differ- 
entiated from weak-minded criminals, or again was unreliable owing to the brief 
period of insertion of the thermometer, and possibly to insufficient care in keeping 
the mouths of the mentally defective children closed during the observation. 
It is not possible in our case to account for this temperature difference by either 
(i) difference of age} or (ii) difference of observed pathological states other than 
those summed up in the word weak-minded. 


One of the noteworthy points of Table A is that when we pass from one or other 
type of heart trouble (V or VII) and from the epileptic and weak-minded groups 
to chronic and organic disorders and diseases of a degree of activity which does 
not involve hospital segregation, we find, without laying too great stress on the 
individual order, that they tend on the whole to subnormal temperatures. 


(8) Further Differentiated Characters of the Weak-minded and Normal-minded 
Populations. 


A study of Table B shows that the weak-minded criminal is differentiated 
from the normal-minded¢ in a number of other characters beyond body temperature. 


TABLE B. 


Table of Means, Standard Deviations and Coefficients of Correlation of the Chief Variates dealt with. 


F Mean Standard Deviation Coefficient of Variation 
Variate Unit _ | = 1 ae a 
Whole | Weak- Whole | Weak- Whole Weak- 
Population | minded Population minded Population minded 
Age Years 43-518 +:438 38:160+-717 | 14:510+°309 | 10-627--507 | 33-34-+-79 27:8541°-43 
Temperature | Degs. Fahr.. 98-373-4-011 98-547 + -022 -486 + -0088 -454+-016 -49-+--O] 46+ -02 
Pulse Per minute | 74:215 +-245 | 77:620-L:578 11-062 Seyi} 11-852+.-409 14-89 + +24 15:27 ate D4 
Respiration Per minute | 18-985+-075 | 19-679-+--168 3:380+:053 3:4383+:-119 | 17-80---29 17-45+ -62 
Height Inches 65-281 ---082 64-788 + :185 2-682 +:058 2:706+:131 4-11+-09 4:18+ -20 
Weight | Lbs. | 142-147+.-516 | 134:397+.-951 | 16-996-+-365 | 13-881-+:672 | 11-96+-26 10:338+ -51 
| 


* Joc. cit. on our p. 2. 

+ This is confirmed by the correlation —-229-+-[-030] for mentality and temperature for constant 
age. See Table C. 

~ The term “normal-mirded” is throughout this paper used of that portion of the criminal population 
not classed as weak-minded, and is not intended to serve as the basis of any opinion as to whether the 
non-weak-minded criminal is or is not mentally differentiated from the general non criminal normal- 
minded population. 

§ It will be noticed that the value of this standard deviation is in very close agreement with those 
found for first and second temperatures in the case of children by Williams, Bell and Pearson. See 
loc. cit. p. 10, where o,, =-5343 and o;,, =-4803. 
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For if he significantly exceeds or falls short of, on the average, the whole population, 
he will a fortiori be differentiated from the normal-minded population*. We have 
the following results: 

Temperature -174+ -025. 


Difference of Means: Pulse 3°205 + +628. 
(Weak-minded—whole population) {Respiration | -694+ -184. 
Height — -493+4 -202. 

Weight — 7-750+ 1-305. 


We conclude from these results that while the weak-minded criminal probably 
but not very certainly has a somewhat less staturet, he has quite markedly less 
weight, and this is combined with a higher temperature, a quicker pulse and 
a quicker respiration. 


We now come to precisely the same point as we reached in the problem of 
differential temperatures. We ask: How far are these differences explicable on 
the basis of the recognised difference in age between the weak-minded and the 
total population? From Table C (p. 11) we extract the following correlations 
and we give the corresponding regressions: 


A a tt 


Regression of Second Variate on Age 
Variates Correlations jee 
Per Year Per 5:36 Years Compan 
Age and Temperature — +150 + -022 — -005,026 — 0269 +174 
Age and Pulse + +121 + -022 + :092,400 + -4951 3°205 
Age and Respiration + -077 + -022 + -017,887 + -0958 694 
Age and Height + :023 + -030 + :004,177 + +0224 — -493 
Age and Weight + +136 + -030 + :159,532 + -8548 — 7:750 


* Let the means, total frequencies and standard deviations of the general, the normal-minded and 
the weak-minded population be g, x and w, fa, fins fws Tys Tr aNd gy, and let 8 be the multiple (say, 
about 3) which a quantity must be of its probable error to be considered significant. Then in the 
usual way —t is significant if 


=_Gis— : 9" i ow _ fy (674498) fo 
J-Wissor>p (67149) / Fe +" (1 an [r+ Sa (FoF) 


Now with 8=3 and our numbers, roughly f,, =190, alas we have 
(674498) fn 
14+." =1 0005, 
we to (fo a) a) 


and this expression may be therefore taken as unity. Hence if we compare 


=a - Cy" 
g—w with °67449 oo" a Sao 
fo iy 


and it is significant. it will certainly be significant if compared with 


2f 
67449 we oo 4 fu (.- oe 
al fo 


which is a smaller quantity. See Biometrika, Vol. v. pp. 181-183. 
+ Worked out from the fuller formula the result is —-493-+:143, or the difference is greater than 
three times its probable error and accordingly may be considered probably significant. 


sr 
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Now it will be seen that while all these correlations are significant except age 
and height*, none of them are of any considerable magnitude, or capable of 
producing any appreciable influence on the observed comparable differences. 
Further since pulse and respiration become very slightly quicker with age, they tend 
to emphasise and not to reduce the observed differences between these characters in 
weak-minded and normal-minded. Age might account for one-ninth of the observed 
difference in weight, or roughly for less than 1 lb. out of the nearly 8 lbs. difference 
observed; and, as before, age accounts for a very inconsiderable portion of the 
observed temperature difference. We are thus justified in concluding that the 
observed physical and physiological differences are quite independent of age 
differencest. We are able to confirm these results by another method of 
investigation, which well illustrates the insignificance of the age correction. In 
our opinion ‘“ weak-mindedness” must depend largely on the personal equation 
of the recorder, and there is no definite bounding line between the weak- and 
the normal-minded. In other words we may consider mental state as an approxi- 
mately continuous variate, and one not widely divergent from the Gaussian in 
frequency distribution. Now it is known that in school-children the feeble-minded 
do not amount to 2°%§; and it is probable that in the community at large their 
number does not even reach this. On the other hand the criminal class is largely 
recruited from the less-intelligent section of the general population. Among 
inebriate and prostitute women probably 40 % to 60 % may be classed as feeble- 
minded||, and though among male criminals such a high percentage is not reached, 
they must bulk very considerably. Goring in his invaluable Report{] considers 
the percentage to be 10 to 15. In our present data we have 100 weak-minded 
individuals and 400 normal-minded, or we have taken 20 °% and calculated the 
correlations between intelligence and the physical and physiological characters 
on this basis. But it is of interest to observe what order of change would be made 
in our results by taking a lesser percentage. Consider, for example, Table XX XV 
for intelligence and temperature. Here the temperatures of the weak-minded 


* Of course the correlation of age and height is largely misleading, the younger criminals are still 
growing, the older are slowly losing height. Thus the regression is really skew. For example, the 
value of the correlation ratio 7 for height on age is -147. If, however, we take the correlation of age 
and height for criminals of over 26 years, at which age other investigations (see Biometrika, Vol. 1. p. 47) 
show that height begins to diminish, the negative correlation is only —-01195 corresponding to merely 
a loss of -0022 inches per year of past prime life, and therefore quite negligible. 

+ Except in the matter of age and weight Table B shows us that judged either by absolute or relative 
variability there is no practical difference in variability between weak-minded and normal-minded 
criminals. 

{ On the continuity of Weak-mindedness, see Questions of the Day and the Fray, No. vim. “The 
Continuity of Mental Defect,” by Karl Pearson and Gustav A. Jaederholm; No. rx. “On the 
Graduated Character of Mental Defect,” by Karl Pearson (Cambridge University Press). 

§ See A Preliminary Study of Alcoholism in Adults, by Amy Barrington and Karl Pearson (Cambridge 
University Press), p. 9. 

|| A Second Study of Extreme Alcokolism in Adults, by David Heron (Cambridge University Press), 
p. 15. 

§] See The English Convict, a Statistical Study (Wyman and Sons), p. 254. 
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form 20:6 % of the 927 recorded temperatures; we will raise the normal-minded 
temperatures to 1719 by multiplying that row by the factor 2-336 retaining, however, 


only unit frequencies. We find approximately : 


Table XXXV, modified to give 10% weak-minded only. 


Temperature. 
Sl ae nl Sn leces LS XR x S oo > St ESS com eS ; 
e/elele|/Sle]}s |e 12 |S | S| = 1s |S le eee 
af > | SS > S S > > | > S > > > > > > > > ~ 
Intelligence JT | Titi Till tlh | i |} Fad ek] od 0 eee 
Sr te eran eta fe ara Ca tb 9 is S be 2 i) SS = Waele soe] ates OSS ll GS 
< a) oS aw | i ~~ i | ra re) io) eo | % co 1a 1s > i x 
| l | | | i 
Weak-minded } — | — |—j|—| 1 4 10 12 24 25 34 31 Za Ns 8 2 ee 191 
Normal-minded] 2 5 4 7 | 49 | 70 | 145 | 201 | 236 | 299 | 248 | 236 | 114 | 61 | 23 | 12 5 2 1719 
Totals 2, 5 4 WN) BOON 7455? 2213) 2260 e324 e28 2512267 | 138) | 776s roils: | 6 2 1910 


Here the correlation r (found by the bi-serial method) is for this 10% of the 
weak-minded 
r = — :231 + -030, 
whereas with the 20 % value of the weak-minded 
r = — :258 + -030. 
We conclude from this result that even halving the percentage of the feeble- 
minded will only alter the correlation by less than its probable error, and that 


accordingly for the purpose we have in hand the bi-serial values of the correlations 
based on 20% of weak-mindedness will give sufficiently accurate results. 


The reader is now asked to examine the last six lines of Table C. He will 
observe that, while nowhere very important, all the correlations of mentality with 
age, temperature, pulse, respiration, height and weight are significant; that the most 
important is weight, and that the signs are in agreement with the conclusions 
already reached, i.e. those with less mentality, or the feeble-minded, have less 
weight, height and age*, but greater temperature and quicker pulse and respiration. 
Examining the second column, which gives for the same variates the partial correla- 
tions for constant age, we see that no very substantial changes are made in the 
crude values; they all remain still significant, and this could have been anticipated 
by merely noting the small age and variate total correlations at the top of the 
first column. We conclude again that the physical and physiological characters 
dealt with are significantly related to feeble-mindedness, but the relationships are 
of such a low degree that they would be of absolutely no service for the purpose of 


* The extent to which this lesser age of the weak-minded criminal is due (a) to earlier average 
conviction, resulting from earlier crime or easier capture, (b) to more frequent transfer to other 
institutions, or (c) to earlier death is of course not determinable from the present data. But it has to 
be borne in mind that fraudulent offenders convicted late in life belong to a different social class, and 
are rarely feeble-minded. This class has usually greater stature and weight. 
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TABLE C. Physiological, Physical and Mental Correlations. 
Partial Correlations 
: Total 
Variates z : Age and] ,_ Weight 
ponzciated hea Age eee Pulse Bee: Height Weight Respite: ee peeane and. 
Constant Constant Constant Constant Constant| Constant ee Constant Constant! ¢ ce 
Age and Tem- | —-150 “195 —-142 --151 —-158 | -160 
perature +-022 021 030) -+:021/ .+-021 | +021 
Age and Pulse | +-121 = +-174. — ech ly +:124 |+-107 +097 
+022 + -022 +-022]; +:022| +-022 +022 
Age and Respira-| + -077 _ = = — +-081 | +-090 L083 
tion +022 +022) +:022 +022 | 
Age and Height | +-023 = = = _ = ~-070 = — 7 = 
+ -030 +-030 
Age and Weight | +-136 = = — — |+-1651 = = = — — 
4-030 4-030 
Temperature and| +-288 |+-312 — — +-283 |+:289 |+:286 |+-305 |+-308 |+-314 | +-286 
Pulse +:020) +-020 +-020} +:020} +-020; +-020} +-020) +-020) +-020 
Temperature and|+-142 | +-156 — +-130 —_ +:144 |+-146 — +:163) | +:158 | +-144 
Respiration +:-022) +-022 +:-022 +:022 |) +:022 +:022) +-022) +-022 | 
Temperature and|+-003 | +-007 — — 027 — — +039 — 
Height +-022| ---022 +:022 +022 
Temperature and|+-043 |+-065 +051] — — — +:075 — 
Weight +:022) +-022 +022 +:022 
Pulse and Res- |+-060 |+-070 | +-020 = == +049 +:071 — +-081 +:059 | +:052 
piration ---022 + -022 -E De +-022 + +022 +-022 claw 22 +-022 
Pulse and Height|-—-078 | -—-081 — _ = aS =a ki8 SF Srl = = 
+:022 + -022 +:022 -+-022 
Pulse and Weight | +-114 +-099 = — = +:197 — == = +181 — 
+ -022 +-022 +-021 L +022 
Respiration and | —:144 | —-146 — = — — --113 — -—-108 ~- — 
Height +:022 | +-022 +:022 +:022 
Respiration and | -—-089 | --101 — — — +007 — — — +020 — 
Weight +:-022) +-022 +-022 +-022 
Height and +:580 | +:583 — — — = aa — — _ — 
Weight +:020} +-020 
& : Ie 
Mentality and | +-264 — — — — +:263 | +234 — -—— -- +-230 
ge +-041 +[-041]} +[-041] +[-041] 
Mentality and |--258 | —-229 — = — —-262 | —-288 — —:262 | --233 | —-289 
Temperature +-020 [-030] +[-0380], +[-030] +[-030] | --[-080]} +-[-030] 
Mentality and | --223 | -—-266 — -214 | —-276 — — 313 -264 | —-289 
Pulse +:030 | +[-030] +[-030] | +[-030] +[-029] | +[-030] | +[-030] 
Mentality and |--148 | --175 — oon — --129 | --127 — --153 | --160 | --133 
Respiration -031 | +[-031] +[-031] | +[-031] | +[-031] | +[-031]) +[-081] 
Mentality and |+-:154 |+-153 — - — -044 — —-031 — — 
Height +:043 | +[-043] +[-044] +[-044] 
Mentality and |+-324 /|4+-304 [4-347 |+-361 [4-315 |+-291 -—— — — +274 -- 
Weight +-040 | +[-040]} --[-040] | +[-040]] -_[-040]) +-[-041] +[-041] 


The partial correlations were all calculated from values of the total correlations to four figures. 


The probable 


errors of the hi-serial correlations were calculated by Soper’s formula (see Biometrika, Vol. x. p. 384). Those of the 
partial coefficients involving bi-serial values of total coefficients were calculated on the assumption that the partial 
coefficients had been calculated directly from bi-serial tables and are placed in squared brackets to indicate that they 


are only approximative. 
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diagnosis, or for discriminating the weak-minded from the normal-minded. An 
examination of the Tables XX XVII, XXXVIII and XLII, XLIII shows that the 
range of the normal in every case is practically coextensive with, or more extensive 
than, that of the feeble-minded. This is in good accordance with the results already 
reached by Jaederholm and Pearson from other data: see Questions of the Day and 
Fray, vir. “The Continuity of Mental Defect,” and tx. “On the Graduated 
Character of Mental Defect.” With regard to the further partial correlations, it 
will be seen that the relationships between the physiological variates (temperature, 
pulse and respiration) and mentality are not much modified by making the physical 
variates constant. The relationship between mentality and temperature reaches its 
maximum for constant weight and height, but the gain is only about its probable 
error; that for mentality and. pulse is more considerable being three times the 
probable error for constant age and weight; while that for mentality and respiration 
is a maximum for constant age and height, but the increase is less than half the 
probable error. Finally it may be noted that the relationship of mentality to 
weight is somewhat intensified if the temperature and pulse rates be made 
constant. 

In no case, however, do we reach correlations of the slightest importance for 
diagnostic purposes. 


(4) On the Interphysiological Correlations. 

The next point to which we must turn our attention is the relationship between 
temperature, pulse and respiration. We have the following series of correlations 
from Table C. 


Partial Partial 

Garclnteds Pine Crude | Correlations for | Correlations for 

2 Correlations | Constant Age Constant Age 

| and Weight and Height 

| 

Temperature (7’) and Pulse (P) +:288-+-020 +308 -+-020 +:314+-020 
Temperature (Z’) and Respiration (2) +:142+--022 +:163+-022 +:158-+ 022 
Pulse (P) and Respiration (R) +:060+ :022 | +:081+-022 +:059-+ -022 


All these correlations are significant, but the sole one of any importance, and 
even that is of little value, is the relation between temperature and pulse. The only 
comparative material is that of Kérésy reduced by Julia Bell (Biometrika, Vol. vit. 
p. 232). She found for pulse and respiration + -108+ -042, in excellent agreement 
with our present value. Her values for pulse and height, — -072 + -042, and for 
respiration and height, — -042 + -042, are also in accordance with ours considering 
their probable errors. The smallness of the relation between pulse and respiration is 
noteworthy, and none of these relations are substantially modified if we make age 
and weight or age and height constant. In fact temperature, as an examination 
of Table C shows, has no relation to either height or weight, while greater stature 
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slightly lessens, greater weight slightly increases the pulse rate. On the other hand 
the “ bantams”’—judged by both stature and weight—have a somewhat quicker 
respiration. But for all practical purposes height and weight may be neglected 
in considering these physiological variates. It may be of interest to table the 
actual association of an increase of any one of these variates on the other two. 


Probable temperature 7, for a given pulse P: T, = 97-4336 + -01266P. 
Probable temperature 7”, for a given respiration R: T’, = 97-9860 + -02040R. 
Probable pulse P, for a given temperature 7: P, = 6-5551 T — 570-6300*. 
Probable pulse P’, for a given respiration R: P’, = 70-4986 + -19575 R. 
Probable respiration Ff, for a given temperature 7’: R, = -98563T — 77-9747. 
Probable respiration R’, for a given pulse P: R’,, = 17-6291 + -01827P. 


Diagram III. Regression Line of Temperature on Pulse for Criminals. 
g I 


Temperature 


Pulse 


In other words the temperature increases by -127° for every increase of 10 
beats per minute of the pulse and by -204° for every increase of 10 respirations per 
minute; the pulse-rate quickens 6-56 beats per minute for each degree of tempera- 
ture and 1-96 beats per minute for every increase of 10 respirations per minute. 
Lastly the respiration-rate quickens -986 respirations per minute for each degree 


* For the normal-minded criminals only this line is P, =6-41807' —557-7481. 
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of temperature, and -183 respirations per minute for every 10 beats per minute 
increase of the pulse. 

A further interesting point is that, if we could possibly suppose these relations 
for criminals, whose disorders, if any, do not enforce hospital treatment, to extend 
beyond the range of values observed, then the pulse would cease at a body 
temperature 7’,, = 87°-05* and the respiration would cease at a body temperature 
T",,= 79°11. We have no right, however, to extend our linear regressions to 


Diagram IV. Regression Lines of Temperature on Pulse for Feeble- and Normal-Minded. 
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these extreme cases; all we can say is that they suggest cessation of pulse before 
that of respiration, and raise the problem of whether the temperature at death, 
T,,, is or is not on the average as much below normal temperature as is indicated 
by the above vague figures. 

In Diagram ITI the regression line of temperature on pulse is drawn to test the 
general accuracy of linearity in describing this relationship. It will be seen that 
graphically the accordance is good, the group 45-48 of pulse-rate (see Table XX XTX) 
has only two entries and the deviation is non-significant. Algebraically for 


* Deduced from normal-minded criminals only 7’, =86°-90. 
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temperature on pulse 7 = +2951 (without corrections) to be compared with 
ry = +2865 (without corrections) = -2881 after correction. 


In Diagram IV the weak-minded have been separated from the normal-minded 
criminals and the separate regression lines of temperature on pulse are given for 
both classes. The equations to these lines are: 


For normal-minded criminals: 7, = 97-3651 + -01315P, 

For weak-minded criminals: 7, = 97-9976 + -00708P, ‘ 
and the correlations are respectively -2906 + -0228 and -1847+ -0471; thus we see 
that there is less correlation in the case of the weak-minded and further that the 
regression lines are no longer, as in the case of temperature on age, parallel. While 
the weak-minded of low pulse-rate have a higher temperature than the corre- 
sponding group of normal-minded, the weak-minded with high pulse-rate tend to 
have equal or perhaps slightly inferior temperature to the corresponding group of 
normal-minded. 


(5) On the Correlations between the Physiological and the Physical Variates. We 
have already seen that various points of the physique of these criminals were 
noted by the medical officer. In the first place their capacity for hard labour; 
actually the class of work on which each criminal was employed was given, but 
it seemed sufficient for our present purposes to divide these into hard and light 
labour. In the next place we have the appreciation of muscularity given by the 
two categories muscular and weak. Physical condition was also marked by the 
classification fat and thin, corresponding to good and bad state of nutrition. 
Lastly the general health as appreciated medically was classed as good, indifferent 
and poor. We made the division Good and Not-good, and used in all cases four- 
fold tables, where the bi-serial 7 was not obtainable. 


The following correlations of the physical characters were reached : 


TABLE D. 


Correlations of Variates measuring Physique. 


General Health |CapacityforLabour| Muscularity NUtOn 
(Good and (Set to Hard or (Muscular or | Tat or Thin) 
Not-good) Light Work) Weak) fae 
| 

General Health sce I = +:770+:027 | +:639+-038 | +-451-+-043 
Capacity for labour +:770+-027 — +:878+:019 | +°369+-048 
Muscularity Ae Dee +:639-+-038 -878-+-019 = |} +°199-+.-047 
Nutrition (Fat or Thin) +:451-+-043 +:369-+ -048 +°199+ -047 | — 


It will be seen at once that these correlations are of a very high order, much 
beyond anything we have hitherto reached. Indeed it is fairly clear that the 
Medical Officer’s general appreciation of health or musculavity tallies closely with 
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the prison official’s appreciation of the type of labour to which the convict can be 
set. We are justified therefore in considering how far these variates of physique, 
which clearly have intrinsic value far beyond the difficulties inherent in the personal 
equation of the medical examiner, are associated with the physiological variates. 


TABLE E. 


Correlation of Physiological and Physical Variates. 


Temperature Pulse Respiration 
General Health Bad +:108 a= -029 —-116 == 029 +:025 Se 029 
Capacity for Labour +-098+-028 | — 010+ -028 —-098 + -028 
Muscularity ... es +:126-+-030 —:179+-030 —-130+:030 
Nutrition Bee ae +:019+-028 +:093+-028 +:-096-+-028 
Height ie eae +003 + :022 — -078 + -022 —-144+4 -022 
Weight Rad sae + 043+ -022 +-114+-022 — 089+ -022 
PAS CMe re ues | —-150+ -022 +:121+-022 +:136+-030 


Now this table is really very noteworthy for all the correlations are low, even 
when significant. In other words the general physical condition has very little 
relation to the physiological variates. In a person, not ill enough for hospital 
treatment, temperature, pulse, and respiration would hardly be a differential 
measure of general health, much less of the goodness or badness of the physique 
in general. Itis true that the person in good health has a rather higher temperature 
and rather lower pulse-rate than one in poor or indifferent health. But there is 
no significant difference in respiration and the correlations are so low that not 
only no rough measures of temperature and pulse would aid diagnosis, but really 
fine numerical determinations would not be of any discriminating value. In 
capacity for hard labour the pulse plays no part, but it is associated with a slightly 
higher temperature and a slightly slower respiration. Muscularity is associated 
with higher temperature, slower pulse and slower respiration. Fatness with 
higher respiration and pulse, but has no apparent relation to temperature. Height 
and weight also have no sensible relation to temperature, the larger men have a 
slower respiration, but the effect of tallness is to slacken pulse, of greater weight to 
quicken it. Pulse and respiration quicken with age, but temperature falls. In 
every case, however, these associations are so small that they would be incapable 
of appreciation except as the mean results of large numbers of accurate records. 
For ordinary every day experience we can only conclude that nothing can be 
judged from the physiological variates of physique or from physique of the 
physiological variates. 


ae 
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(6) On the Relation of External Temperature to the Physiological Variates. 
In the memoir by Williams, Bell and Pearson, it has been shown that such variations 
as occur in seasonal change of external temperature do not influence the body 
temperature of one and the same individual*. The present material enables us 
to confirm this result for fairly long series of temperatures on different individuals. 
We have correlated external temperature with body temperature, pulse and 
respiration, and find the following values: 


Correlation of External Temperature and Body Temperature: — -002 + -023. 
Correlation of External Temperature and Pulse: 7108 + -023. 
Correlation of External Temperature and Respiration: 067 + -023. 


Thus body and external temperatures are to the extent of seasonal change unrelated. 
Pulse quickens and respiration quickens in rather warmer weather. But again 
these environmental influences are of a very unimportant nature, and the associa- 
tions are even less than those between the physiological and physical characters. 


(7) On the Individuality of the Physiological Variates. We have seen that 
temperature, pulse and respiration are variates, which have in reality very little 
association with physique as measured by such anthropometric characters as age, 
height or weight; nor are they, within the limits of non-hospital state, at all closely 
associated with general health or nutrition. Further they are practically unrelated 
to occupation (nature of labour) or environment (external temperature) as far 
as we have been hitherto able to measure these factors. Does this therefore 
signify that they are quantities rapidly varying, changed by a variety of causes 
not yet appreciated, and of no essential importance to the individual, as having 
no close relation to any of the bigger measurable factors of personality, physique 
or environment? The answer to this question is, perhaps, best sought by asking 
which of all the characters we have dealt with is most closely related to the physio- 
logical variates? There cannot be a doubt that the answer must be the mentality 
of the individual, and this factor of individuality we know to be one that is remark- 
ably persistent and personal. The majority of the weak-minded are congenitally 


- weak-minded, and but few of them really change their intellectual grade in the course 


of their life-time. Those who believe that intellectual power is the product of 
environment, or on the other hand that it is closely related to physique, or to any 
anthropometric measurement of the external body, have yet to demonstrate their 
position, and many facts have been collected which are markedly opposed to itt. 
There is much evidence to show that the chief mental characters flow from con- 
genital and hereditary potentialities. Can it be that these physiological variates 
have a like origin?’ The actual demonstration of this cannot, perhaps, be com- 
pleted on the basis of our present data, but very weighty evidence for its truth can, 
however, be deduced. We have already indicated that in the majority of cases the 
observation of temperature, pulse and respiration was repeated at an interval 
* See loc. cit. pp. 5-7. 


+ See R. S. Proc. Vol. 69, pp. 333-342, Vol. 71, pp. 106-114; Biometrika, Vol. m1. p. 39, et seq., 
Vol. v. pp. 105-146. Letters in British Medical Journal, January 27—March 17, 1906, etc. 
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varying from 2-3 weeks to 2-3 months. We are thus able to correlate first and 
second temperatures, first and second pulses and first and second respirations, 


and the results are of very marked significance. We find the following values 
of the correlations: 


First and Second Temperature: + -395 + -019*. 
First and Second Pulse Rate: + -644 + -012. 
First and Second Respiration Rate: + +856 + -006. 


The accompanying Diagram V indicates the distribution of observations of 
First and Second Respiration and their clustering about the regression line. 


DracramM V. Scatter Diagram and Regression Line for Respirations before and after an interval. 
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It will be evident at once from the above results that our physiological variates 
—above all respiration-rate, but markedly also pulse-rate—are essentially constant 


* The value found for first and second temperatures in the case of children by Williams, Bell and 
Pearson (loc. cit. p 10) was ++5238+-0346. 
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characters of the individual, characters which change only slightly with age, and 
which are only moderately linked with each other and with mentality. They 
are not subject to much influence from environment or occupation, and it appears 
to us that respiration-rate would probably form as good a mark of individual 
identity as most anthropometric characters and, since there is a relatively low rate 
of correlation between these physiological variates, their combined records might 
be even of value as a supplement* to finger prints in the problem of identification. 
The above values undoubtedly indicate that the degree of individuality which we 
find in the external measurable features of the body and in the mental characteristics 
is maintained, and just possibly exceeded, in the manner in which the various 
organs of the body carry on their functions; in other words the ‘“ individual” is 
not only characterised by an individualised framework and an individualised 
controlling faculty for that framework, but its metabolic processes are indi- 
vidualised also. The individual is such not only by his existence in space, but by his 
manner of existing in time. The individuality in the functioning of organs is as 
great as the physical differences in the organs themselves. In the future it may 
well be that heredity will be as effectively studied by measuring the functioning 
of organs as by measuring the organs themselves. The clinical thermometer 
and the sygmograph may gain new scientific value when they are introduced into 
the field of genetics. 

I wish, in conclusion, to express my indebtedness to Professor Karl Pearson 
for his continuous supervision throughout the course of the work and to thank 
him for many ideas and suggestions set forth in this paper. 


* Of course two persons will have the same respiration but hardly an absolute sameness of finger 
prints. What is meant here is that if finger prints have to be placed in definite descriptive ‘“categories”’ 
—a loop of so many ridges or a subclass of arches—these categories will hardly be more individual than 
the respiration-rate. Again a pulse tracing would correspond for individuality more closely to an 
actual finger imprint, than a pulse-rate which would represent a certain category of pulse tracings 
like a finger print class represents a certain type of finger prints. 
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ALPHABETICAL INDEX TO CORRELATION TABLES IN APPENDIX. 


Correlated Variates. 

Age: with Height, XXI; with Mentality, XLI; with Pulse, XXVIII; with Respiration, XXXI; 
with Temperature (Total population), XXXIV ; with Temperature (Normal-minded only), XX XVII; 
with Temperature (Weak-minded only), XX XVIII; with Weight, XX. 

Health: with Labour, XIV; with Muscularity, XV; with Nutrition, XLIX; with Pulse, VI; with 
Respiration, X; with Temperature, IT. 

Height: with Age, XXI; with Mentality, XVIII; with Pulse, XXVII; with Respiration, XXV; 
with Temperature, XXIII; with Weight, XVII. 

Labour: with Health, XIV; with Muscularity, XVI; with Nutrition, XLVIII; with Pulse, XXX; 
with Respiration, XX XIII; with Temperature, XXXVI. 

Mentality: with Age, XLI; with Height, XVIII; with Pulse, XXIX; with Respiration, XXXII; 
with Temperature, XXXV; with Weight, XL. 

Muscularity: with Health, XV; with Labour, XVI; with Nutrition, XLVII: with Pulse, VIII; with 
Respiration, XII; with Temperature, VI. 


Nutrition: with Health, XLIX; with Labour, XLVIII: with Muscularity, XLVII; with Pulse, VII; 
with Respiration, X1; with Temperature, IIT. 


Pulse: with Age, XXVIII; with Health, VI; with Height, XXVII; with Labour, XXX; with 
Mentality, XXIX; with Muscularity, VIIT; with Nutrition, VIL; (Occasion A) with Pulse (Occasion 
B), XIX; with Respiration, V; with Temperature (Total population), XXXIX; with Tempera- 
ture (Normal-minded only), XLII; with Temperature (Weak-minded only), XLIII; with 
Temperature (external), XLV; with Weight, X XVI. 

Respiration: with Age, XX XI; with Health, X; with Height, XXV:; with Labour, XX XIII; with 
Mentality, XXXII; with Muscularity, VIII; with Nutrition, XI; with Pulse, V; (Occasion A) 
with Respiration (Occasion B), IX; with Temperature, I; with Temperature (external), XLVI; 
with Weight. XXIV. 


Temperature: with Age (Total population), XXXIV; with Age (Normal-minded only), XX XVIT; 
with Age (Weak-minded only), XX XVIII; with Health, II; with Height, XXIIT; with Labour, 
XXXVI; with Mentality, XXXV; with Muscularity, IV; with Nutrition, III; with Pulse 
(Total population), XXXIX; with Pulse (Normal-minded only), XLII; with Pulse (Weak- 
minded only), XLIII; with Respiration, I; (Occasion A) with Temperature (Occasion B), XIII; 
with Temperature (external), XLIV; with Weight, XXII. 

Weight: with Age, XX; with Height, XVII; with Mentality, XL: with Pulse, XXVI; with Respira- 
tion, XXIV; with Temperature, XXII. 


VARIATION AND CORRELATION OF THE NUMBER OF 
UMBEL RAYS OF SOME UMBELLIFERAE*. 


By C. COOL anp A. N. KOOPMANS, 


Laboratory of Microscopical Anatomy, Technical High School, Delft. 


(1) Since the investigations by Ludwigt into the variation of the number of 
umbel rays of the umbels of Umbelliferae, but little is to be found about this subject 
in biological literature, although the reading of the chapter “Summationskurven 
der Umbelliferen” of the article above-mentioned might suggest various problems. 


For example, in the variation curves of Ludwig the numbers of the Fibon- 
nacci series do not exclusively appear as maxima, but in some cases also the 
numbers of the first parallel series (1, 3, 4, 7, 11, etc.) appear. Voglert also finds 
in his investigations into Astrantia major that principal series as well as parallel 
series occur in this case. 


The few investigations by Ludwig on the Umbelliferae are hardly sufficient 
to allow us to state that modes which arise in the numbers of the first parallel 
series of these variation curves are with any degree of certainty only apparent 
modes, or to assert that they have resulted from the mixing of plants of different 
local races. On the contrary it is very probable that these numbers of the first 
parallel series are a characteristic phenomenon of the variation curves of the 
number of umbel rays of many species of Umbelliferae. 


Of the many countings which we made in the summer of 1912 to find out 
something more about this subject, only those relating to three species of plants, 
Pastinaca sata, Anthriscus sylvestris and Scum latifoliwm, are used here. 


Only numbers of umbel rays of terminal umbels and those of lateral umbels 
of the first order were counted. 


* Our thanks are due to Prof. Dr G. van Iterson who kindly revised this paper. 
+ Bot. Centralblatt, Bd. Lxtv. 
+ Bethefte zum botanischen Centralblatt, Bd. xxiv. 
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Fic. 1. A represents an umbel with one fasciation of two rays which are united to the top, B an 
umbel with one fasciation of two rays which are not united to the top, C an umbel with two 
fasciations each of two rays with union somewhat lower. 


Fic. 2. D represents an umbel with two fasciations, one of two and the other of three rays, H an 
umbel with one fasciation of four rays, F an umbel with three fasciations each of two umbel rays. 


Special Fasciations of Pastinaca sativa. 
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(2) CountTiIneGs oF PASTINACA SATIVA. 


Variation of the number of wmbel rays of the terminal umbel. 


In Table I are arranged, according to the increasing numbers of umbel rays, 
terminal umbels of two groups of plants growing in different places; the first group 
of 341 plants came from clayey and peaty waysides in the neighbourhood of Delft 
(Holland), the second of 335 plants for the greater part from sheltered, for the 
smaller part from exposed spots in the duneland not far from the village of Lisse 
(Holland). The countings were made in the months of July and August of the 
year 1912. 

In counting, a large number of fasciations were observed in the umbels. Among 
515 plants, of which both the terminal and lateral umbels were counted, 167 were 
observed to have fasciations. As many rays were taken into account as there 
would have been, if no fasciation had occurred. Plate I, figures 1 and 2, shows 
some very interesting fasciations. 


Variation of the number of wmbel rays of the lateral umbels of the first order and 
correlation of the numbers of umbel rays of terminal umbels and that of lateral 
umbels. 

The countings of umbel rays of lateral umbels have been combined in two 
different ways. In the first place lateral umbels belonging to plants with the 
same number of rays in the terminal umbel have been united into groups; and in 
the second place those which belonged to plants with the same number of lateral 
umbels. . 

In Table IT, 3836 lateral umbels of 515 plants have been divided into groups, 
each of which has been placed in a horizontal row and contains all the lateral umbels 
of plants with the same number of rays in the terminal umbels. 

The mean number of umbel rays of the lateral umbels is greater than that of 
the terminal umbels. For that reason terminal and lateral umbels are considered 
separately. 

In Table III the lateral umbels are divided into groups in such a way that all 
the lateral umbels of plants with the same number of lateral umbels are arranged 
in one horizontal row. 

The distribution of the number of umbel rays of the 3836 lateral umbels is 
represented graphically in Fig. 3. 

The distribution, which is given in the first and the last column of Table ITI and 
which gives the number of lateral umbels per plant, has not been studied in detail 
because it is of no importance for this investigation. The constants 6, = 0-144 
and B, = 3-485, the “critical function” = 0-208 and the curve which will fit best 
to this distribution is a Pearson curve of Type IV. 
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If we consider the different distributions in the horizontal rows of Tables IT 
and III in which the maximum numbers are printed in heavy figures, we become 
convinced that besides the numbers of the principal series such as 8 and 13 (and 
multiples of numbers of this series such as 15 and 16), also numbers of the first 
parallel series such as 11 and 18 (and perhaps multiples of numbers of that series, 
14 for example) are not infrequent. 


Frequency of 
the lateral wmbels 


Fia. 3. Pastinaca sativa. 


It is certainly very remarkable that with the increasing number of rays of the 
terminal umbel or with the increasing number of lateral umbels per plant, the 
lateral umbels that occur as maxima in a horizontal distribution possess a number 


of umbel rays that leaps up from 11 in one to 13 in another horizontal distribution ; 
in other words: 


The number of umbel rays 12 does not occur as a maximum in any of the 
horizontal groups-of the two correlation tables. 
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(3) CounTINGs oF ANTHRISCUS SYLVESTRIS. 
Variation of the number of umbel rays of the terminal umbel. 


A little preliminary investigation showed that the number of umbel rays of the 
terminal umbel of Anthriscus sylvestris is very small and that in most cases this 
umbel consists of 1, 2, 3 or 4 umbel rays and in some cases of 5, or is entirely missing. 

Afterwards four countings were made: 

(a) On 502 plants in the neighbourhood of Beverwijk on May 16th, 1912. 

(b) On 305 plants in the neighbourhood of Lisse on May 26th and July 16th, 
1912. 

(c) On 77 plants partly in the neighbourhood of Lisse, partly in that of 
Bentveld, on July 6th, July 19th and July 27th, 1912. 


(d) On 46 plants from the Schapenweg in the neighbourhood of Rijswijk on 
May 19th, 1912. 


The result of these countings of the number of umbel rays of 930 terminal 
umbels is given in the following table. 


TABLE IV. 


Variation of the number of umbel rays of the terminal umbel of Anthriscus sylvestris. 


Number of umbel rays] g 1 9 3 4|\ 5 | 6 |) 7| 8 | 9:3\7onletae 


of the terminal umbel 


Number of terminal 9 | 107 1196 | 143 | 40 | 4 Ta ies eal eee ste 
umbels. Group (a) 


Number of terminal < ia | | 
| umbels. Group (6) 16 | 107 |112 | 49] 14] 3 3 1 


| Number of terminal : 
| umbels. Group (c) Te 2260023" 23a tall 


Number of terminal 
umbels. Group (d) 13) 15 Se |) zh i) oT 


Number of terminal 
aeaelameMatale 26 | 253 | 346 | 228 | 59°) S79) 3 ) Sb | — | 1 ] 


Mean of umbel rays for 930 terminal umbels =2-125. 
8.D. of umbel rays for 930 terminal umbels =1-119. 


The graphical representation in Fig. 4 of the distribution in the last row of 
Table IV is an asymmetrical polygon with mode at 2 umbel rays. The point 
which is obtained by taking the terminal umbels with 0 umbel rays into account 


ede 
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accords with the one that we can expect when we know only the other points and 
causes the variation polygon to become rather symmetrical with regard to the 
vertical through the mode. 


Frequency of 
the terminal umbels 
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Fig. 4. Anthriscus sylvestris. 


Variation of the number of umbel rays of the lateral umbels of the first order and 
correlation of the number of umbel rays of terminal umbels and that of lateral umbels. 


Of 235 plants (being the same series for which the rays of the terminal umbels 
were counted and which were included in the preceding distributions) the number 
of umbel rays of the lateral umbels was determined. 


Again these lateral umbels were divided into groups, in the first place into 
groups belonging to plants with the same number of rays in the terminal umbel, 
and in the second place into groups belonging to plants with the same number of 
lateral umbels. 


In the following table have been assembled the results of the countings on 
1632 lateral umbels of these 235 plants in which the first sub-division was taken. 
It is easy to see that the numbers 8 and 10 are predominant. 
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TABLE V. Correlation of the number of umbel rays of terminal umbels and lateral 
umbels of Anthriscus sylvestris. 


Number of umbel rays of the terminal umbel. 


Number of lateral umbels with Bedi | INaene | 
3 od 5 6 | 7 | 8 9) | 20 | 12 | Le 5) 13 | 4S |) LON L7 WL Sal eto of ber of 
belie: lateral | plants 
umbel rays Late 
| | 
0 — | Boh 26425 5|_ eile 13s 03h 2 | | 71 9 
1 2) .23') 27 | 37 | 55 \135)) 119)) 107) 440) U7 1|/—]|—/]—/]—|]—|]—I] 567 77 
2 | 3! 5/15 | 92 | 43 |408| 97/127) 73 | 49 | 15) 5 | =: | 5) 2) Se eee 
3 1 3 6) 7]|19]| 30] 45] 82) 68 | 30 | 24 7 2). 32) =) sao 50 
4 _ i 4 2, 3 Salil ih 4 3 1);—) —}]—}]— 50 9 
5 1 1/—/|—] 2 5| — DAD 1 1 2 1 | = 1;/—|— 22 3 
6 — 1/—| 3 2, 3 ] Tey) 1h |) 1 13 2 
7 —|— Lh al 1 2; 1 4, 2 1 1 1 1 1 1 1 1 20 3 
| | | 
Totals] 7 | 36 | 55 | 75 | 137 |804 | 279 |344| 212}105/ 47 | 18 | 5] 4 2 1 1 | 1632 235 
Mean of umbel rays for 235 lateral umbels =9-127. 
8.D. of umbel rays for 235 lateral umbels =2-150. Coefficient of correlation =0-3033. 


The mean number of umbel rays of the lateral umbels is much greater than the 
mean number of rays of the terminal umbels. For that reason terminal and lateral 
umbels were considered separately. This great difference of the average numbers 
to a certain degree accounts for the fact that the coefficient of correlation is low, 
much lower than with Pastinaca sativa. 

The second sub-division of the lateral umbels in groups belonging to plants 
with the same number of lateral umbels gives the following result: 


TABLE VI. Correlation of the number of lateral umbels per plant and the number of umbel 
rays of lateral umbels of Anthriscus sylvestris. 


Number of lateral umbels per plant. 


Number of lateral umbels with Dec Name 

8| 4) 5| 6| 7] 8 | 95) £0) TP o\12°| 13> 142159) 16 | 274) tsa rg of ber of 

; lateral | plants 

umbel rays Bel 

[=| ] rerprareral era 
Pais) Wee i ae meee es | = 2 1 
5 | | 2 1 3 1 
4 Loy 3.) 225 air 2 — 8 2 
| aa eae ey els). eel eee OR ey i Gy sh 1 1 I 1] 115 23 
6 |—] 3| 5] 6/| 28 | 56) 61 | 74) 48) :34) 08) Gy) 25) a i roe 57 
Ui Y | U1 ) 22°) 27.) 41 1125) 98 11'9)) 699) 800585) eo 79 
8 4 | 10| 12 | 23 36 | 55 | 54 | 75 | 47] 22/15) 9) 2) 3 | —|—|—Ff] 367 46 
{ — |} 8} 11 | 10 | 15 | 32] 36)43) 17) 7 1}; —]|—}—]—]—]—F] 180 20 
2; 3; 2] 4] 6/15) 7} 8] 8)—/—)|]—)/;— 50 5 
i P24 | ee alll 13 1 

a eee af eee ae ced cee etl pet oc) asa] an ani mane | Cees | rca (ime | eee | eee | ea 
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In this table also the numbers 8 and 10 are conspicuous just as in the sub- 
divisions not reproduced here of each horizontal row in the above table, which 
were made according to the number of rays of the terminal umbel. 


It is remarkable that there is not a single case of Anthriscus sylvestris in which 
a number of 9 umbel rays in a horizontal row of one of the Tables V and VI occurs 
as a maximum, whereas the numbers 8 and 10 not infrequently occur as modes. 


Frequency of 
the lateral umbels 
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Fie. 5. Anthriscus sylvestris. 


Figure 5 gives a graphical representation of the variability of the number of 
umbel rays of these 1632 lateral umbels. 


(4) Countines or Siva LATIFOLIUM. 


Variation of the number of umbel rays of the terminal umbel. 


The number of plants examined of Stwm latifoliwm was much smaller than 
the number of plants of the preceding species, which were submitted to examina- 
tion. Only in the case of 127 plants were the umbel rays of terminal umbels and 
lateral umbels counted, and of one only of the umbel rays of the lateral umbels, 
because the terminal umbel was missing. 
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The distribution of the numbers of umbel rays of 127 terminal umbels was as 
follows: 


TABLE VII. 


Variation of the number of wmbel rays of the terminal umbel of Sium latifolium. 


umbel 


Number of | | | 


of the ter- 
minal umbel 


Number ofter-} . 
minal umbels 


| | | 


T9Y8 | 74| 15 | 16 | 17| 18 | 19 PARTIES 23| 24|25| 26! 27 | 28| 29/30 | 31| 32 aa 35 | 36 | 37 


Total 


‘| 127 


Mean of umbel rays for 127 terminal umbels = 25-638. 
S.D. of umbel rays for 127 terminal umbels= 5-948. 


Conclusions could not be drawn from this distribution, the number of variates 
being too small with respect to the great range of variation. 


Variation of the number of umbel rays of the lateral umbels. 


The lateral umbels again were sub-divided into groups belonging to plants with 
the same number of rays in the terminal umbels and into groups belonging to plants 
with the same number of lateral umbels. As the former groups were very small, 
they can give little or no information about preference for certain numbers. The 
latter groups have been united in Table VIII. 


Though the small number of umbels, of which the rays were counted, and the 
great range of variability make it necessary to be prudent in drawing conclusions, 
we think it probable that not only the numbers of the principal series are of 
importance in this case, but also other numbers as for instance 18 and 29. 


Correlations. 


The coefficient of correlation for the number of rays of terminal and lateral 
umbels was as follows: 


For Pastinaca satwa ... aE 0-7269 + 0:0051. 
» Anthriscus sylvestris a 0-3033 + 0°0152. 
, Stum latifolium  ... a 0-8693 + 0°0070. 


It is remarkable that this coefficient of correlation is different for these three 
species of plants, and it would be worth examining if this coefficient has a constant 
value within certain limits for different local races of the same species of plants. 
In that case a number would have been found, characteristic of each particular 
species. 
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Biometrika x1 


ON THE PARTIAL CORRELATION-RATIO 
PART II. NUMERICAL 


Being Part II of a Thesis for the degree of D.Sc. in the University of London 
By L. ISSERLIS, B.A. 


1. The object of the present paper is to provide numerical illustrations of 
some of the results in the first part*. Suitable material is not easily obtained, 
as many tables dealing with three variable characters are artificially curtailed as 
regards one or more of them. Iam indebted to Miss Ethel M. Elderton for having 
kindly placed at my disposal manuscript tables of the heights to the nearest inch 
and weights to the nearest pound of the boys in group A (comprising schools in 
the poorest parts of the city) of her investigations into the heights and weights 
of school children in Glasgow f. 


I have arranged the material in tables which differ from Tables I-IX of Miss 
Elderton’s paper in the following respects: 


(i) A uniform grouping of 5 1b. intervals for weight and 3 inch intervals for 
height is adopted at all ages; 


(uu) 384 boys of central age 5 (ie. ages 4:5 to 5-5 years), omitted in 
Miss Elderton’s tables, are here included, and two boys (age 6-5 to 7-5 years) 
whose height was below 21-5 inches, included in her tables, are omitted here. 


These tables are cut off artificially by the regulations of the School Board with 
regard to school leaving age and the admission age of infants. 


Tables I-X contain the data, the frequency distributions of height and weight 
being given for each of the ages from 5 to 14 of the 11,382 boys. 


* “On the Partial Correlation Ratio,” Part I, Theoretical, by L. Isserlis. Biometrika, Vol. x, 1914, 
pp. 391-411. (The title of the first part was badly chosen, the generalized H there introduced is a 
multiple one, providing a measure of the dependence of one variable on two or more other variables. 
An example of a partial 7 would be ,7,.,, being the correlation ratio of z on y for a given value of 2. 
Pearson has recently published very simple formulae connecting the partial correlation ratio with the 
multiple correlation ratio: see R. S. Proc: Vol. 91, A, pp. 492-8. They suffice to show, that a 
knowledge of the multiple correlation ratios leads at once to the partial correlation ratios, and to that 
extent justify my title.) 

y “Height and Weight of School Children in Glasgow,” by Ethel M. Elderton. Biometrika, Vol. x. 
1914, pp. 288-339. 
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Tables XI, XII, XIII give the distributions of height and weight, height and 
age, and age and weight for the whole of the individuals in the preceding ten tables. 
They are summaries. The direct calculation of one of the three generalized H’s, 
say that of age on height and weight, requires the use of Tables I-X, while the 
various approximations given in the paper cited can be obtained from the 
summaries in Tables XI—XIII. 


I have calculated the correlation ratio of age on height and weight, and that 
of weight on age and height directly, and compared them with the values obtained 
as approximations from the summaries. 


The direct calculation is laborious and the results show that the various 
approximation formulae are in general sufficiently accurate to enable us to dispense 
with this heavy labour. 


Throughout the present paper, I denote the age by ‘z’ with an arbitrary origin 
at 10 years, the height by ‘x’ with the origin at 49 inches and the weight by ‘y’ 
with origin at 56 lbs. 


2. With the arbitrary origin at 49 inches, the raw moments for the height 
frequencies are: 
C—p,—— 05113861", 
Pp e= 3453699, 
Pp g= — 4938324, 
Dp —- 00 ONS OL. 
Thus the mean height is 49 — 3 (0-511861) = 47-46 inches. 
The moments referred to mean are given by the formulae: 
Par = Per — (p') = 3191697, 
Da = Po — 3p' 2 Pq + 2p’, = 0-096908, 
Pat = Po — Ap’ P'e + Op ep’? — 8p'a* = 25-7912. 


Dye : 
From these Py = —, = -0003, 


The standard deviation is o, = 1:78653 in 3 inch units, or 5-36 inches. 


For weights the arbitrary origin is at 56 Ibs., and the raw moments are: 


y= p’, = — 0-205412, 
pp = 6772096, 
py’ yp= 5979090, 


py = 134-637146. 


* For the sake of uniformity the constants in all intermediate results are given to six places of decimals. 


4—2 
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So that the mean weight is 56 — 5 (0-205412) = 55-97 lbs., and the moments 
referred to the mean are: 
Dy 6-729902, 
Dy = 10-134965, 
Py = 141-258877, 
giving Wp Oo tOs Aehy SS EIU). 
The standard deviation is o, = 2°594205 in 5 lb. units, or 12-97 lbs. 
With arbitrary origin at 10 years, the raw moments of the ages are: 
Z—= pp, — — 0-o6celh: 
p2= 6402214, 
pp = — 9423124, 
p'a= 80-916710. 
The mean age is therefore 10 — 0-568617 = 9-43 years, and the moments about 
the mean age are: 


I 


I 


Pz = 6078889, 

Pa — 1-130401, 

pe = T1591 474, 
leading to on — 00D I. Pe = 179374; 
while the standard deviation is o, = 2-47 years. 


3. The mixed moments pz,, Px,, etc., can be found from the correlation tables 
XI-XIII. The values of the raw moments required are as follows: 


Psy =  4:311017, Pm =  3°949042, 
P' wy = — 1:066245, Po = — 7108065, 
Pay = 74:045335, Pez = 50313120, 
Py = — 3°712880, D a%z = — 5488842, 
Paty = 35°657353, D' win = 24225619, 


P ay = 5430241,  p' g2y2 = 47-927956, sp’ ay, = — 4341768, 2,2 = 37-510631. 
The corresponding moments about the mean are: 
Day =P ay —2Y = 4205875, 
Py; = 5313440, 
Pzx = 3°657989, 
Pay = P ay — 2YP' xy — EP yp + 2xy* = 4-128088, 
Pays = P wey a BYP oy oP BPP ay <. LD ys ar BLY 2 = 79-144020, 
Paty? = P ary? ram 29D a2y aie 2ED" wy? st: PD a ae Cp yp ar AY D' wy oa S07y? 
— 49-011115, 
Poyz = P nye — XYZ — BPyz — YPexw — 2Pxy = 1580733. 
Similar formulae give: 
D8, = 302279, (Oe | OPAISICRS), 
Pay = 33°336761, Putz = 19-108627, 
Det" a2090N, Pues = 42°785163. 
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75,7570 
4. Further we take: Qaly"2" = ation 
Ty Tyro,” 


We now collect into a single table the values of the reduced moments, the 
correlation coefficients and the correlation ratios. 


TABLE XIV. 

Gen -157282 Tey ‘907489 | = — 
Gap 343344 Ma -918157 ie — Peay -019478 
Gai 2-281729 My -915430 amy — Pony -014476 
on 2-253653 == = = = 
Ge 2:537432 ys -830729 = a 
ds 016995 Me “851196 ys — Pay 034424 
des 2-531793 me -833060 ny —? one -003878 
Vy -580509 sone = — ame 
Ay 3-118877 ee -830463 = == 
Gas -027925 Ns -844792 ats — Poon -024005 
(es -030294 Ne -831447 Mn — een -001635 
est 1-597889 = ea = -_ 
Gs 1-359207 = = = == 
x22? 1:605274 = = a —— 
ay elggesot |) = | = — 


It will be observed that the differences .»7, — 7?,, and 7, — 7?,, are very small, 
showing that the regression lines of weight on age, and of height on age are nearly 
straight. The six empirical regression lines are given in Figs. 1-6. 


Fic. 1. Regression of Age on Weight. 


¥. 
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Fic. 2. Regression of Weight on Age. 
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Age in years. 


Fic. 3. Regression of Height on Age. 


& 
ice) 


Mean Height 


Height in inches. 
ae 
[op] 


as 
Oe 


p 
o 


oF 5 i Gis sl OG an en Se 


Age in years. 


Age in years. 


Weight in Ibs. 


31 


L. [SSERLIS 55 


Fic. 4. Regression of Age on Height. 
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= 


4 87 40 48 46 49 52 55 58 61 64 


Height in inches. 


Fic. 5. Regression of Weight on Height. 


Mean Height 


Mean Weight 


34 37 40 48 46 49 52 55 58 61 64 


Height in inches. 
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Fic. 6. Regression of Height on Weight. 


s 
D 
% 
= 
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© 
= 


5 
49 Mean Height. 


Height in inches. 


26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 101 106111 116 121 
Weight in lbs. 


5. The mean ages of the a, y arrays are given in Table XV. From this table 
H?, is calculated directly by the formula: 


xy z 


rier 2 
ny ll?, —i98 {ay (2 Zany) } 


No,? 
= oligos 
or Sora 
liane ag teen ZN ye Vz00T xn 
ee Ut ee ee 
= 1233 
and pity 81505), 


We see that the regression of age on height and weight is very far from planar. 

The difference 
pyle — pyle — 10280. 

We have at present* no formula for the probable error of the multiple correlation 
ratio, but some idea of its magnitude may be obtained from Blakeman’s formula t 
for the probable error of ,7?, — Pry: 

oe ay Ses VN. 4 : / xn? — 7? 
Probable error of (47y— Toy) 674491 + (1 — 7)? — (1 —?)2 


* Since writing the above, the author has obtained a formula for the p.e. of ,,H,. It is very 
similar to Pearson’s well known formula for the p.e. of the simple 7. 
+ J. Blakeman. Biometrika, Vol. Iv, p. 332. 
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With the above values of ,,f?, and ,,R?,, the right-hand side of Blakeman’s 
formula reduces to 13-47, so that on this assumption ,,H?, — ,,R?, is 13 or 
14 times greater than its probable error. 


If we examine the regression of weight on height and age, we find in the same 
way that: 


wnll?,, = -8634, wall, = +9292, 
Rt, = -B4970 sghy = 9180, 


enll?, — ,,R?, = -0207. 


In this case too the regression is significantly non-planar, Blakeman’s formula 
reducing to 11-43 so that ,,H?, — ,,k%, is more than 11 times greater than its 
probable error. 


Physically ,,H, is more interesting than ,,H,. A knowledge of the former 
enables us to construct by means of equation (53) of my Part I, a formula 
predicting the mean weight of a group of boys of given age and height. Such 
a formula would be an alternative solution to the one given by Table a on 
p. 300 of the paper by Miss Elderton referred to above. 


6. We may consider now equation (70) of Part I which gives the approximate 
age of ,,H?, — »,R?, in terms of constants all derwable from Tables XI, XII, XIII, 
i.e. constants whose determination does not require a detailed knowledge of the 
distribution. The equation is: 


(ene ay.) Gaye — 1 


2 
Ga2y2— Tey 


i W , yr ce ed 2 

Voy Vary — Tar Vyz — Taz? ay Gn, — 7) — (V yz Vyal ey (72 — Pay) 

= . ]—r yz zy 1— Pr yx Pay 
Vx2yl xz — Tay?" yz xy ry 


; sae as e N28 
TaeyCay? — Vary Voz — Vyzl ay 2 2 Vyz — VazV ay 2 2 
Seay eae Wi) ae : E528 i (Ge) 
ay 


5 ; 2 
Vay? yz — Va2yl az sale ay 


When ,f,, ,8,;* are very small and ,,, .n, and r,, are nearly equal, we use 
instead of (70) the following equations: 


2 Re = tee Moy ( Cy a) 71 
pe? Fe CPS Gace 1 yVz U op) Goo udonuneeuuddces ( ys 
FH, — ,4R?, = 22 —# (at, — Pea) (72) 
Cyn tZ CY ee dazu y ele ) ee 4), 
ay 
or approximately 
: Sie cia oy ee ; 
ea kes a Byles ad D2 = Gas 72. 43) Piiccrrrrer err Chal). 
al ee) 
Le ; 
py oy leg — 32 SUA eta) os iewccecnwosetchtinds Gia). 


* ,8, is used to denote the f, of the frequencies of the x character, 
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For the regression of age on height and weight, the values of ,,H?, — 
and of ,,H?,, given by the preceding equations, are tabulated below: 


TABLE XVI. 

Source ay”, a ayh?, ay”, 
(70) 0303 ‘7537 
(71) 0392 ‘7625 
(71) 0381 ‘7615 
(72) 0273 ‘7507 
(72) 0266 7499 
pee: 0286 ‘7519 


Calculation / 
We notice that (72) and (72)’ give much better values than (71) and (71)’. 
This is to be expected for ,8, is very small with a value of -0003, while ,f, is 
quite appreciable with a value of -3370, and since 7,,, ,Y2» ay are pretty nearly 
equal (cf. Table XIV), equations (72) and (72)’ may be used to replace (70). 
Another point of interest is that there is very little to choose between the 
values given by the cumbersome formula (70) and those given by (72) and (72)’. 
If we now consider the regression of weight on height and age, we find that 


equation (70) is unsuitable for the calculation of ,,H?, — ,,R?,. 
For in this case aPy = °0003, 
Px = +0057, 
ay eS 
Me = 8314, 
fi == OoUDs 


and we expect the right-hand side of (70) to be indeterminate (cf. Part I, p. 406). 
We find 


leat — W2a = — -007104, 
Vradex i Vea? i -002767, 
and lay WVen =z Tey Tzu? — ‘000175. 


Thus the right-hand side of equation (70) depends on the ratio of quantities 
which practically vanish when their probable errors are taken into account. 

If, nevertheless, (70) is employed, it gives ,,H?, — ,,R®, =-1580, and since 
oh, = 8427, we get ,,H?, = 1-0007!, i.e. a value just greater than unity*. 


The values of ,,H?, obtained from (71) and (72) are tabulated below: 


TABLE XVII. 

Source NEED sd onl? y 
(71) 0054 848] 
(72) -0203 -8630 
(71)’ 0048 84.74. 
(72)’ 0177 8604 

Direct : ; 

Gariation) °0207 8634 


* Of course by an amount < p.e. of ,,.R,. 
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In this example also as ,8, is much nearer to zero than ,f, we get better values 
from (72) and (72) than from (71) and (71)’. 

We may conclude from these numerical cases that, if 

(i) ,f, and ,B, are fairly small, and 

(ii) the regressions of y on x and that of « on y do not depart widely from 
linearity, we get very fair values for ,,H?, from an equation of type 


2 2 
eee ioat alte ae Gee i 5) 


2 
2 Vary ~ Tay 2 2 
By a he <a syles —. uae a (7 Pad} aa 


and that it is better to use the equation which corresponds to the smaller 6, than 
to use the mean of the values given by the two equations of each type, i.e. if ,B, 
is smaller than ,8, we use the formula in terms of ,7?, — 7*,, in preference to the 
one involving ,7?, — 7%,, and in preference to the mean of the two. 

If the conditions (i) and (ii) are not satisfied, we use equation (70), which requires 
more labour than (71) or (72), but is much easier than a direct calculation. 


7*. Itisa matter of interest to see whether the regression surface of age on height 
is of the type considered in Part I of this paper. The regression surface being 


Zey —2 _ gg | a(v—z)  b(iy—y) } c (x — &) (y— 9) 


i= 


O, Ox Oy O7o0y 
ee\2 2 
+ ee A gh ie crag (28). 
The solution of equations (29)-(34) is: : 
d = -060374, c = — :0940467, 
a = -360742, e=  -072020, 
b = -547960, f = — 040709. 


Here ¢ is decidedly smaller than a and 6, but e and f are of the same order 
asc. This seems to suggest that our formulae have a wider range than the type 
of surface by which they were suggested. If we solve equations (36) to (39) 
(Part I) in which the coefficients e and f are assumed to be zero, we obtain 

d = -060172, b= — -512257, 
a = -376024, ¢ = — :066306. 
On substituting these values in (59) and (60), we find 


or olals eat, 
Now equations (59) and (60) were based on the assumption that the arrays of 
y’s for constant « and the arrays of a’s for constant y are homoscedastic. 


* This section differs from the preceding—it does not deal with genuine approximation formulae— 
for in the determination of the constants a, b, c, d we use quyz, i.e. the detailed table. It illustrates 
some other points in Part I. 
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It is easy to remove this restriction. As on p. 402, we write 


a~a\ 2 pee y 
gen a ht oe one — SY |(« aig <u 5 | 


( Oy 
Yi 2 Y 2 Oy . 
= a (1— yn, )+ 2ac 8, (2 a i] + 2S, (& eaty a ee (59). 


2 
Pca Ga o2naNh 


2 
: : Oa, 9 
If the x arrays are homoscedastic —~’ = 1 — ,7?, and we get (59). 
Ox 


Similarly equation (60), corrected for heteroscedasticity, becomes 


nope 2 2 
Dees Sal ap) bas @ = 7") + 28, ( a > =) See (60)’. 
Applying these corrections to the case of ,,H*,, we find 
ayll®, — *724535 = -0221969 + -0003345 + -0006875 ............ (59)’, 
or eilelen eT Aallis) 
and ay”, — °713674 = -0424845 — -0053862 + -0010073 ......... (60)’, 
or eee OLS 


showing the separate contributions of the various terms. 
The agreement of (60)’ with the empirical value -7519 is remarkably close. 


For the regression of weight on height and age, equations (36) to (39) give the 
values 
a = -24570, b = -70004, ¢ =-12160; 
if the regression hyperboloid be written in the form: 


YUrx — U Z2—Z L— 2B L—- 2) (2-2 
Yen dE ae +b ee ) ) 
Oy Oz Ox Ox OG, 


The values given by the equations corresponding to (59) and (60) are 
ally = any + (@ + Ce) (1 — xn.) = -838012 + -021519 = +8595, 
aatl?y = any + (& + ce) (1 — 2777.) = 693989 + -15580 = -8498, 
while the equations corresponding to (59)’ and (60)’ give 
zl”, = °838012 + -0172780 + -00011322 + -0034254 = -8588, 
and zall*y = °6939891 + -15128 + -0048971 + -0091002 = -8593, 
the empirical value of ,,H”, being -8634. 


These approximations are not quite so good; they are bound up rather closely 
with the assumed hyperboloidal form of the regression surface, while Mr Soper 
has shown that in the case of weight on age and height, a good fit is obtained by 
a surface of the form: 

Yeu = Uy + yz + Age? + (by + Biz + B22) w*. 


8. Mr M. Greenwood, Jr, has kindly drawn my attention to a joint paper, 
entitled “A Study of Index Correlations,” published by Mr J. W. Brown, Miss 
Frances Wood and himself in the Journal of the Royal Statistical Society t. 


* Elderton’s paper, loc. cit. p. 294. 
+ Vol. uxxvn, Part ILI, February, 1914, pp. 317-346. 
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The authors give the value of a generalized H, obtained by direct calculation 
in two or three cases. The material they deal with is the distribution of births, 
deaths and populations in 1000 English registration sub-districts, and also of the 
birth- and death-rates and populations in these districts. The material is highly 
heterogeneous and exceedingly high values of B, occur. Thus, 8, for deaths is 
16-0205 and for death rates is 61-5392. 


Unfortunately in their table on page 341 of the values of the simple 7’s, the 
authors fail to discriminate between 7, and ,7,. 


Mr Greenwood, at my request, was good enough to consult his ms. notes and 
sent me an identification table of the y’s, but a recalculation of the 7 of populations 
on births and the y of populations on deaths has shown that errors have crept 
into this identification table. 


When amended this table is as follows: 
i OU 
Mn —-Jo08, 
a ole 
Sip lela 
Oa 
7 8871, 
where «= population, y = births, z = deaths. 
With these values our equation (70) gives 
gy, ar aylve = 2691, 
or ayllt, = 8014, 
and pyle. = 9008. 
The authors give as the empirical value ,,H, = -9384. 
As regards the use of the simpler formulae, the only 8, of reasonably small 


value is 8, of population which equals -0886, 


22, 
We put py = Gye = at = (he rer) 
a Ly 
. 5 )\2 
or wvll?, — (7825)? = Daa [(-9241)? — (-7822)°], 
or nyHT2, = 87200, 
oH, = +9338. 


Considering the wide divergence in the results obtained by the authors when 
dealing with the 1000 sub-districts and with 999 of them* it is evident that the 
use of the above simple formula would have been sufficient for their purpose. 


* 1.¢., Pp. 343: population and births Hacatns =*9384 for 1000 sub-districts, and =-9213 for 999 sub-districts. 
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GLASGOW SCHOOL BOYS—GROUP A. 


TABLE I. Age 4:5 to 5:5 years. 
Weight in lbs. 
| 2428 | 29-33 | 3438 | 39-43 | 4448 Totals 
| 30—32 2 Zs pea = = 2 
| 38—85 5 15 5 — — 25 
36—38 1 18 72 8 — 99 
39—41 = 5 87 90 7 190 
42.44 = sat 4 35 21 65 
| 45—47 _ a I = 2 3 
. | 

Totals 8 38 169 133 30 384. 

TABLE II. Age 5-5 to 6-5 years. 


Weight in lbs. 


TABLE III. 


Age 6-5 to 7-5 


Weight in lbs. 


years. 


Height in inches 


Totals 


Height in inches 


Height in inches 


Height in inches 
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TABLE IV. Age 7-5 to 8-5 years. 
Weight in lbs. 
oD oe) oD 29 oD °9 A) co Oa) eo) i; 
oD oD “H “SH > ’%s Ne) So ~ ~ 
| | | | | | | | | [Totals 
i oD oD “H SH Lm) ie) Ne) ©S ~~ 
hemechise eee | | 
BB 85 3 1 | | 6 | 
36—38 13°9)\ 15 3 1 32 
CO =? C88 || G3) £18 2 121 
42-44 i etSh4 146. | 2579 |) 7d 1 491 
Ze Vee 2 | 36 | 204 | 283 | 89 4 (deen eee ea 
ome | eS 13 2 | 86 |) 40 Tl eee | ce PP 299 
51—53 res 6 9 4 oo a 1 1 31 
54—56 | 2 | | 2 
57—59 | 2 1 | 1 | | 4. | 
Totals 3 | 69 | 261 | 501 | 453 | 183 | 54 9 i | 1 | 1535 


TABLE V. Age 8-5 to 9-5 years. 


lbs. 


mn 


Weight i 


74—78 | 


| Dow & be 


TABLE VI. Age 9-5 to 10-5 years. 


Weight in Ibs. 


| 


Totals | 


3 

4 
BBD — — 1 
36—38 2, 3 — 1 | ] 8 
39—41 1 7 6 2, 16 
4244 2 14 27 23 4 70 
45—47 --- 5 54. 148 93 10 il 311 
458—50 —- 3 Se 127 281 159 35 5 — 613 
51—53 — 1 ee ad 42 130 | 108 2 7 323 
54— 56 : 10 5 30 
57—59 = 4 
60—62 2 
63—65 1 
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TABLE VII. Age 10-5 to 11-5 years. 
Weight in lbs. 
oD “Sh SH ro) om) Ss Ss ~ ~ io) ine) 
| | | | | | | | [Totals 
“ S “Hy Ss “SH > “SH S> MH SD k SH 
i) oD “SH SH io) 1D S S ~ ~ co 
ni|'oo—38 | 1 | i 
36—38 | 1 | 1 
Se ier en eee | ae 3 a ies ese pic (|, oo eee ee 5 
| 49-441 1 2 8 12 1 | | 24 
a | 45—47 | — | 3 16 65 55 10 2 — | — — — 151 
“7 | 48—50 | — I 5 40 184 161 45 ll} — -—— _ 447 
Ss Jl—53 7 49 166 148 81 12 2 _— 465 
00 | 54 —56 ] 13 42 56 455s emeS 3 168 
ssl d7— 59 1 — 2, —_— 2 1s 3 5 14 
60—62 2, . 2 
Totals >, 6 32 126 29] 352 239 15] 58 13 8 | 1278 
TABLE VIII. Age 11-5 to 12:5 years. 
Weight in lbs. 
if 56 [iso leon |wesinll peo aul teats [ines ll ecmallccn tall postal mesealmes 2 
oD “MH SH M%»> br) S ive) ~n ~ oD oD D> > I 
| | | | | | (ie hl led | | | | | Totals 
sis iVlVi sl Sl Ss six jRlSsisisigs 
: S 
wn | 86—38 
© | 39—41 
"S | 42-44 
-— | 45—47 
q | 48—50 
SS b1—53 
<q | 64—56 
= b7— 59 
60—62 
aa 63—65 
Totals 


Height in inches 


TABLE IX. Age 12-5 to 13-5 years. 


Weight in lbs. 


| | nore | 


| | 


14) 6 

50) 17 

42| 44 
heel 


Totals 


109—113 


Totals 


| 


Height in inches 


TABLE X. Age 13-5 to 14:5 years. 65 
Weight in lbs. 
] op | 0 | & | 0 | 
SRM cca xe ica loos | oe lo PS la tay la | 
[SP Ae ai | | | | | | | Lt | tj d [ Dotals 
SLCMCM roster cae der | so) So | Sos |S heyy |S 
i ™ ~™ LN 
oS ] | | | 
wo 44) | 1 | | — | | 1 
4547 | 2} 3/ 1) —|—]|— 6 
Peo 5 | 7 | 7 | 1 | 3 
§1—538{— | 2 3 16 | 31 36 wy il ak | = 98 
Pee | | | 7 | 41 | 49: | 27 | 6 | 2] 2 | 161 
Pee i | = 7 |) 19 | 28] 28.) 16 | 7 | 3: 109 
60—62 | — Peale, Poh ih AS eel Gr Eakin oe 
63—65 | — | | eon ieee aah Xe 
| | Le | 
Totals} 1) 6| 12 | 31 | 65 | 85 | 78 | 57 | 40 | 22 | 15 | 9] 6)—|—] 1] 428 
TABLE XI. Correlation Table of Heights and Weights. 
Height in inches. 
m | | 2% y > y > 2. | © | & o 9 
os on) oD “ “MH ™ my) ao) Sey |e Ae) Ss Ss | 
Br te id dele Totals | 
S oD Ss Sd XN at) me) iat SH ~~ i) ar) | 
oD a) bn) Os) “H “MH “M > ae) | wD Ss Ss 
24—28 4| 9 2 —; —]| | 16 
Poems 42629) 25 | 3) 1 | — |  — — | — a6 | 
34-38 | | 16 |220 | 414| 72] 6 | 728 
39—43 1 On OleOle e697 e195 HE tal 1476 
44-48 |— 1 7 | 122 | 875 | 603 | 38] 8 tI o— —|— 1655 | 
49—53 |— — 4 12 | 249 | 988 | 411 | 33 5 4;—|— 1706 | 
eos —o6! i) —— | | 3 1 17 | 436 | 905 | 171 1] 4 3 | — 1552 | 
| 59—63 | — | — Le | 1| 39 | 630 | 568 51 6 1 — 1297 
a 64—68 ie yal: —| 8 | 161 | 621 | 206 3 2) 2 1004 
“4 | 69—73 1, —}] — 35 | 374 | 34024 2) — 776 
= 74—78 | 3 | 106 | 335 | 76 5 = 525 
op | 79—S83 | | | | | 2] 22 | 120] 93 4} 1] 242 
“3 | 84—88 | Pie ieee h Sua 87 ea] On ing 
=| 39-93 | [Tel Osos | iS. | 66 
94—98 | | | B23) |) 8h 2 37 
99—103 | = 5 Mal | 33 19 | 
104—108 | | 1};—/; 5|] 1 7 
109—I113° | 1; — | 
114 
119 
Totals 8 | 72 | 350 | 1193 | 1914} 2178 | 2196} 1913] 1115] 361 | 69 13 | 11382 
Go a — _ —— — — | — — at 
TABLE XII. Correlation Table of Height and Age. 
Age in years 
| | | 
) 6 7 & 9 10 Il 12 15 | 14 Totals | 
30—52 2 2 3 — 1 8 
i 33—35 25 23 16 6 — 1 | 1 — — — 72 
© | 36—38 99 | 146 55 32 8 8 ] 1 — — 350 
S| 39—41 | 190 | 505 | 312 | 121 43 | 16] 5 1 et 1193 
.4 | 42—44 65 | 469 | 601 491 187 70 24 5 1 1 1914 
Sto 27 3 95 | 332 | 619 | 583 | 311 15] 54 24 6 2178 
“4 | 48—50 | — 4 31 229 | 458 | 613 | 447 | 258 134 22 2196 
eB | 01—53 | 2 31 99 | 323 | 465 | 474 | 42] 98 1913 
eal yeaah | | ep a |e 2/ 18 | 30 | 168 | 321 | 415 | 161 | 11165 
Ss | d7—d9 | 4 2 4 14 57 171 109 361 
| 60—62 | | 3 2 2 4 | 33 | 25 69 
| 63—65 | — 1} — 3 3 6 13 
| Totals [| 384 | 1244 | 1352 | 1535 | 1402 | 1379 | 1278 | 1178 | 1202 | 428 | 11382 
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TABLE XIII. Correlation Table of Age and Weight. 
Age in years 
| | | 
5 | 6 | (ie Pleat? | 9 10 | d1 | 12 | 73 | d4 \Wotals 
24—28 16 
29—33 136 
34—38 728 
389—43 1476 
44-48 1655 
49—53 1706 
| 64-58 1552 
2) 59-63 1297 
Sj 64—68 1004 
ar 69—73 776 
a 74—78 525 
ao) 79—83 242 
‘D 84—88& 138 
=) 3993 66 
94—98 37 
99—103 19 
104—108 7 
109—113 1 
} 114718 — 
119—123 1 
Totals [| 384 | 1244 | 1352 |1535 |1402 |1379 |1278 |1178 |1202 | 428 .|11382 
TABLE XV. Mean Ages of Height and Weight Arrays 
x y ey | & y Zany | © y eyy x ¥ Zyy v y Say 
-§6 -6 -4:25 8 0 -1 O -1 -—0-508516 | 2 2 2:194174 | 4 3 0:5 
-—5  —2-666667 2 -4 0 0-046409 3 2-361764 4 3-2 
Bee 185 Aap 1 0-604762 4 2531343 5 8 
-5 -6 -—4:555556 |-2 -5 -3-333333 2, 1-118012 5 2758333 6 3:5 
-5 -—4-071428 | -4 -—3-083333 3 1-542857 6 2-812500 fil 3-222222 
—-4 —3:5625 —3 —3:008608 4. 33 Hl 8 3°444444 
—3 —2-666667 — 2) —2°6222857 5 2, 8 3:666667 9 3°363636 
_9 0 | Silly Silvis 1h ea} (0) 10 3 10 4 
(0) 1 | QO -1-235294 | —-2 -—1:25 a sll sll 11 3 
Ly lg” Has | 3 -1 -0-151515 0 1-25 5 2 eal 
-5 -4-096774 |-1 -6 -3 0 0-883041 1 0:333333 5 2 
-4 -—3-927273 | -5 -3 1 1-158450 2 3 6 3 
—-3 -3:078431 Aes y 2, 1:624798 3 3 7 3 
—-2 —2-428571 | —~3 -—2-084210 3 2-048128 4 2-921052 8 4 
—-] =1:75 —-2 —1-815920 4 2:367924 5 3:021505 9 3°333333 
QO —1-666667 | -l] -—1]-292510 5 2545455 6 3:091954. 10 4 
1 0 | 0 —0-658257 6 2°75 ri 3°305555 13 4 
=$-—5 —4:04 ] 0:256410 a 33 8 3:260869 
—4 ~—3-763285 , 1 yop yy) a) | 9 3:4 
—-3 -—3-505673 | 6 3 -l -l 4 0 —0:333333 
-2 —2-786885 On 38S D2 0 0-727273 a 
=i) = 15 | =D 0:5 1 2-098039 2 1 


CORRELATION OF ANTHROPOMETRICAL 
MEASUREMENTS IN CAIRO-BORN NATIVES* 


By MYER M. ORENSTEEN 
(Ministry of Public Works, Egypt) 


1. The Material. Since 1910, owing to the completion of an anthropometrical 
study undertaken by the Egyptian Government under the direction of Mr J. I. Craig, 
now Controller-General of the Statistical Department, a series of measurements 
of about ten thousand Egyptian natives has been filed in the Survey Department. 
It has been thought of interest to take this opportunity to re-examine the 
material in order to determine the correlation between the different organs of the 
native body available. 

Referring to the nature of the characters considered in the paper, it will be 
necessary to remember here that the data belong to anthropometrical slips of 
prisoners, taken from the Cairo Anthropometrical Bureau. The slips have been 
drawn absolutely at random without any particular character of selection. 

To Mr J. I. Craig, the writer addresses his best thanks for kindly allowing him 
full access to the material, and for valuable advice. 

2. The Data. The measurements available and dealt with in the paper are: 

(L.) Length of the Head. This is the shortest distance between the glabella 
and the occipital point as measured with callipers. 

(B.) Breadth of the Head. This is the maximum breadth of the head. It is 
measured with callipers just above the back of the ears. 

(L.M.F.) Length of the left middle Finger. 

(L.F.) Length of the left Foot. 

(L.C.) Length of the left Cubit. 

(S.) Stature. This is the total height standing without boots. 

From the ten thousand individuals in hand, who are natives of all parts of 


Egypt up to Aswan, 1191 persons have been selected as being exclusively males 
born in Cairo. From this selected group a further number of 389 has been 


* As the Editor was unable to consult the author of this paper, he has considered it best to alter 
as little as possible the literary expression. 
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excluded for being, at the date when measured, youths of under 20 years of age. 
The remaining 802 individuals providing the final group of measurements dealt 
with throughout the paper are, therefore, adult male Egyptians born in Cairo. 
Of this final group 76 per cent. were measured during 1904-1905, the remaining 
24 per cent. fall within the years 1901 and 1906. 


3. Social class to which the Natives dealt with belong. From the fact that 
these natives have undergone police examination, objection may arise as to 
whether the measurements of such persons can represent the ordinary native 
body, or if really they belong to a selected group of individuals, namely the 
criminal class of natives. 


TABLE I. 


Movement of the natives measured, their distribution as regards length of imprisonment, 
and their comparison with nugration of ordinary inhabitants. 


| Place where migrated to 
Length of Imprisonment 
Lower Cairo Upper 
Egypt Town Egypt 
One year and less... ae 8-8 | 71-2 6-8 
Over one year and up to three 0-8 10-5 1-0 
Over three years Ae ae 0-0 0:8 0-1 


Total ... Soe mae 9-6 82:5 7:9 


Ordinary Egyptian inhabitants 6-3 | 91-1 2-5 


Mr J. I. Craig, writing on the same material, says* “it may be objected that 
criminality in itself is a determinating factor of selection, but the objection does 
not hold in Egypt. Here it cannot be said that there exists a definite criminal 
class, and criminals are rather amateurs than professionals. This state of things 
is in all probability due to the easy conditions under which the lower class live.” 


In support of this, I may mention some recent conclusions which Dr Charles 
Goring comes tof. “Criminals as criminals are not a physically differentiated 
class of the general community....The physical and mental constitution of both 
criminal and law-abiding persons of same age, stature, class and intelligence are 
identical....There is no such thing as an anthropological type.” 


* “Anthropometry of Modern Egyptians,’ Biometrika, July, 1911. 
+ The English Convict: A Statistical Study, by Charles Goring, M.D. 
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These statements are suggestive for our material, and some investigation has 
been thought desirable in order to ascertain how far the former is justified for the 
Cairo group when considered alone. This investigation was carried out in two 
directions : 

(1) The seriousness of the offence or crime committed. This of course is 
proportionate to the length of imprisonment. 

(2) The social class to which the natives belong. This is traced by looking 
into the daily occupation of each individual. 


TABLE II. 


Daily occupation of men measured. 


% % 
Coffee-house keeper ... 9-4 Tailor ... 2-4 
Pedlar eas ee 9-2 Cultivator 2-2 
Servant 79 Baker ... 2-0 
Driver 7:4 Butcher 1:7 
Workman 6-9 Cobbler 1:6 
Blacksmith 6-4. Barber... 1:3 
Vagabond 6-2 Washerman 1-2 
Carpenter 5:5 Coal-seller 1-1 
Painter 4:8 Water-carrier 0:8 
Cook aes 4-0 Shoe-black 0-8 
Tobacconist ... 4-0 Bricklayer 0-6 
Clerk ... 3-1 Student 0-6 
Fruiterer 2-7 Printer 0-5 
Weaver a 2-6 Milkman 0:3 
Confectioner ... 2°5 Watchman 0:3 


Table I gives in per cent. the different lengths of imprisonment. This table 
also exhibits the stability of the group measured when compared with the total 
mass of the ordinary native male population. The figures of the latter group 
were taken from the Census of Egypt in 1907, where 221,185 individuals have been 
classified as being male Egyptians born in Cairo. From Table I we observe that 
most of the men have been condemned to very short punishments. On the whole 
only nine natives have been registered for murder (manslaughter); 86-8 per cent. 
have been kept in custody for a period varying between a few days and one year; 
12-3 per cent. have been kept in for a length varying from one year and up to 
three. The remaining 0-9 per cent. have been imprisoned for a period over three 
years. This state of things is surely due to slight offences done without premedi 
tation. 


b) 


The figures under the heading “migration” show clearly that most of the 
Cairenes settle in their native town (as is also the case in the whole country). Out 
of the Cairo migrating element, ten per cent. prefer to settle down in Lower Egypt 
while eight per cent. move into Upper Egypt. We further remark that the ordinary 
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native migrates less than the group measured by the police. The natives of the 
former selection remaining in the town exceed by 10 per cent. those of the latter. 
This in all probability is to be explained by the fact that the census figures 
include children and boys under 20, up to which age there is less tendency to 
migration than afterwards. 

Table II gives a fairly exact idea of the daily occupation or social class to which 
the men measured belong. From this table we first remark that practically all 
men have their daily occupation, and that the greater part of them deal directly 
with the public. The proportion of these is 87 per cent.; the ordinary native 
element is represented by 52 per cent. Vagabonds and men without distinct 
occupation come in with 6 per cent. 

As a result of the analysis of Tables I and II, we may come to the conclusion 
that the people dealt with in this paper are not necessarily of the poorest class of 
the total mass of the Cairo natives. And although they have undergone the 
police examination they are not necessarily of a selected class of natives, nor are 
they under special conditions of health. 

4. Age of the People. As we have already said, all youths under twenty years 
of age have been excluded. Most of the natives are ignorant of their right age and 
could easily deviate by ten or twenty years from the truth though with apparent 
good faith. The age was estimated by the police officer on duty. 


The percentage frequency of men per group of age is as follows: 


Census 

From 20 to 25 years ... 52% 25 % 
58) 20) toeaOe 95,5 ee 28% 18% 

3) ol tor4Oy ee. 16% 26% 

pa Eton 70, Wey oes 4% 32% 


It will be seen that the above grouping eliminates the greater part of accidental 
errors due to personal equation of the observer. The figures given by the census 
are said to be very unreliable. 


TABLE III. 


Means, Standard-deviations and their respective Probable Errors. 


Character M+HM oF tHe 
Length of Head, in mm. ... 190-52+0-14 | 5:90-+0-10 
Breadth of Head, in mm. 144:45-+0-11 4:67+0-08 
Left Middle Finger, in mm. 114-:09+0-15 6-14+-0-10 
Left Foot, in mm. ... ; 257:64-+0-30 12-36+0-21 
Left Cubit, in mm. a5 457°51-+0-52 21-21-+40-36 
Stature, in cm. : xe 165-82-++0-14 5:93-+0-10 
Cephalic Index, in o, ae 75-82 0-07 2-91-+0-05 


* Sheppard’s corrections have been applied. 
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5. Frequency Distributions. The total number of individuals measured being 
802, the measurements were grouped so as to save time and labour. 

The grouping of the data has been arranged so as to give approximately eleven 
sub-ranges of grouping in each case with limits as follows: 


Length of the head 


every 3 millimetres 


Breadth of the head... eS. 3 
Left middle finger ore eet: 
Left foot ... Ms on ats - 
Left cubit ate bat og.) Fe 


Stature... tat bole >» 4 centimetres 


Table III gives the mean, the standard deviation corrected by Sheppard’s 
adjustments, and the corresponding probable error for each character observed. 
The cephalic index is computed as usual from the ratio of the breadth to the 
length multiplied by a hundred; this gives a measure as to whether the head is 
relatively long or broad. 

The frequency distributions were further analysed. Sheppard’s corrections 
were applied to all the moments, and the criterion « for the classification of the 
curves determined. « in all cases was less than unity; 6, and £, computed from 
fly, [l3, fla (the second, third and fourth moments) were practically 0 and 3 re- 
spectively for all distributions. 

The normal curve of the form Y = y,e~“/2” was therefore chosen. 

The theoretical frequencies for every sub-group of each distribution of measure- 
ments have been computed from the following formulae: 


Length of the head: 
Breadth of the head: 
Left middle finger: 


= 162-73e—2/69°61382, 
_ 2 142°6194 
= 205-50 ¢-2 /43'6194, 


= 908-43 (FAC OCELE 


Left foot: = 181-20 ¢-2#?/305°5150 
Left cubit: = 226-30 e-27/899°3250 
Stature: f = 215-77 e-27/70'3072, 


where f is the frequency for every unit of grouping, 
vis the distance from the origin (in these cases the origin and the mean 
coincide) to the foot of the ordinate to be calculated, 
e is the base of the Napierian system of logarithms. 

The comparison between the observed frequencies and the theoretical fre- 
quencies computed from the above formulae is given in Table IV. x? is calculated 
from the sum of the squares of the departures of the theoretical values from the 
observed ones, divided by the corresponding theoretical values. This is shown 
by the formula 

X= 2[Yo — Yeh l¥cl- 
P is the probability that, in another random sample of the same size, a worse 
fit would be obtained. 
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TABLE IV. 
(a) Length of Head (6) Breadth of Head 
Centre | Observed | Computed es | Centre Observed | Computed A 
of Group | frequency | frequency | 9 _¢ of Group | frequency | frequency 0-C 

mm. O Cc mm. O Cc 
174 4 3 Pa ly 131 = 6 3} + 3 
177 8 12 - 4) 134 8 17 = 59 
180 41 33 + .8 137 75 58 + 17 
183 | 69 72 a ees 140 121 130 a) 
186 | 114 122 - 8 143 190 196 - 6 
189 177 158 + 19 146 202 195 + 7 
192 160 158 + 2] 149 125 128 - 3 
195 98 122 - 24 152 55 56 I 
198 75 73 eee al 155 ll 16 - 6 
201 3) 34 + 5 158 8 3 + 3 
204 a 12 ah 161 1 0 fioce AD 
207 10 3 ee AT, 
x2 =3A7/C =30-3 P=-0015 X2=2A2/Cl=19:2 P =-0378 


(c) Left Middle Finger 
Centre Observed | Computed ee 
of Group | frequency | frequency 0-C 
mm. | O Cc 
94-5 | 1 1 0 
98-5 6 8 - 2 
102-5 | 32 35 3 
106-5 lll of 14 
110-5 176 176 0 
L435 | 215 208 7 
118-5 137 161 - 24 
122-5 80 82 = 2 
126-5 36 27 + 9 
130-5 | sy) 6 al 
134-5 3 ] + 2 
Vo > Ae Ce 9:8 P =-4590 
(e) Left Cubit 
| Centre Observed | Computed nee 
| of Group | frequency | frequency 0-0 
mm. 
395 | 3 4 - Jd 
410 20 18 ap 
425 72 70 en) 
440 160 161 - 1d 
455 224 225 - 1 
470 186 190 - 4 
485 100 97 + 3 
500 26 30 - 4 
515 10 6 + 4 
530 0 ] =siicrl 
545 | 1 0 Bi pent 
2 =3A7/C =3-9 P=-9489 


(d) Left Foot 


Centre | Observed | Computed ees 
_ of Group | frequency | frequency | g_g 
| mm. | O C 
225 5 6 =e el 
232 19 21 = 2 
239 70 58 + 12 
246 111 117 nO 
253 178 169 - Y 
260 167 178 - Ill 
267 122 136 - 14 
274 88 76 + 12 
281 28 30 - 2 
288 ies | 9 + 2 
295 3 2 + 1 
| x7 =35A2/C =8-7 P=:5677 
| 
(f) Stature 
Centre Observed | Computed NE 
of Group | frequency | frequency | ¢ _¢ 
cm. O C 
149-5 4 5 - 1 
153-5 28 25 aS 
157-5 73 81 - 8 
161-5 185 166 + 19 
165-5 212 215 - 3 
169-5 167 178 - dil 
173-5 87 93 - 6 
177-5 31 31 0 
181-5 a 7 + OY 
185-5 1 1 0 
en (OC) alliloy/ P =-2341 


Frequency per thousand individuals 
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Fic. 1. Length of Head. (y=202-90x 2°71828~27/69°6132) 


220 oO, 
200 
180 
160 
140 
120 
100 


ua) 
= 
oS 

i 

§ 
cS 


Mode,mean and median 


174.1 177 


o> _ 
T T T — — = iF == 
180 | 183 | 186 | 189 | 192 | 195 | 198 | 201! 204 | 207 


Measurements in millimetres 


Fia. 2. Breadth of Head. (y=256°23 x 2°71828~#7/43°6194) 
260 
240 
220 
200 
180 
160 
140 
120 
100 
80 


60 
40 


20 
10 


Mode, mean and median 


Quartile 


Frequency per thousand individuals 
Quartile 


131 134 137 140 143 146 | 149 152 155 168 


Measurements in millimetres 


Fic. 3. Left Middle Finger. (y=259°89x 2°71828 —17/75°3052) 
280 
2 
260 . 
240 
220 
200 
180 
160 
140 


120 
100 
80 
60 
40 


20 
10 
ie) 


Mode, meanand median 
Quartile 


Frequency per thousand individuals 


94-5 98-5 ' 102-5 !106°5 ! 110-5 '114°5 ! 118-5 '122-5 ' 1265 | 130-5 '134- 
Measurements in millimetres 


74 


Frequency per thousand individuals 
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Frequency per thousand individuals 
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Fie. 4. Left Foot. (y=225-93x 2°71828~ 27/305'515) 


Quartile 


225 232 239 |' 246 253 260 267 274 281 288 295 


Measurements in millimetres 


Fie. 5. Left Cubit. (y=282°17x 2°71828~27/899"2250) 
280 
260 
240 
220 
200 
180 
160 
140 
120 


100 


Mode, mean and median 
Quartile 


425 440 455 470 
Measurements in millimetres 


Fic. 6. Stature. (y=269-04x 2°71828 ~*7/70'3872) 


Quartile 
Mode, mean and median 


149°5 § 163°5 ' 157-5 ' 161-5 ©1655 169°:5 © 178:5 © 177:5 °181°5 © 185°5 


Measurements in centimetres 
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Figures 1 to 6 give the calculated frequency curves. The lengths of the 
ordinates represent the frequency per thousand of observations in a corresponding 
group where the ordinate considered is the mean for the unit of grouping. The 
dotted curves have been drawn from the observed frequencies reduced to the 
same base. A glance at these figures is sufficient to show that normal curves fit 
pretty well our statistics. The quartiles have been drawn with the aid of an 
integraph. 


6. Coefficients of Correlation. With the same grouping of the data I have 
constructed correlation tables. These are given in Tables VIII to XIV. 


The coefficient of correlation for the length and the breadth of the head was 
found to be r = + 0:244+ 0-020. This coefficient is as we should have expected 
lower than the results from the remaining pairs of characters. Attention may be 
drawn to the fact that this particular coefficient varies very much from race to 
race. A comparison with similar results for different races may be of interest here: 


3 

Smith Sound Eskimos ine y= +0°-47+0-08 
Ainos oPe r= +0-43-+40-06 
Naqada Race ... r= +0:34-40-05 
Germans r= +0-:29-+0-06 
Bavarian Peasants r= +0-28-+0-06 
Cairo Natives ... r= +0-24+0-02 
French Peasants — r= +0:13-+0-09 
British Columbian Indians ... r= +0-08+? 


Taking into consideration the great variation of this coefficient from race to 
race, the writer is greatly inclined to believe that a correlation between the length 
and the breadth of the head really exists, but that its variation in value is very 
probably due to special factors, probably working independently and tending to 
modify the real measure of relationship. This is being further investigated, and 
a closer study of such factors is being undertaken. 


The results of the computation of Tables VIII to XIV are grouped in Table V. 
From this table we may deduce the following conclusions: Were all human bodies 
perfectly similar all the coefficients of correlation between the different organs 
would be unity; if on the contrary there should exist a complete independency 
of the organs, the coefficients would be all nil. We may therefore conclude from 
Table V that we are to expect a tall man to have more often long feet than long 
fingers and short men will more often have long fingers than long arms. Further, 
men with long arms will very rarely have short fingers, and such people will more 
often be men with greater stature than men with long feet, and so on. 


Table VI gives the probable error for each coefficient of correlation shown in 
Table V. They are in all cases very small, which proves our results to be very 
reliable. 
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The results shown in Table V have been compared with the like data for 
Kuropean countries, and no serious deviation in the coefficients has been noticed. 
This tends to confirm the statement made above for the head characters. 


TABLE V. 


Coefficients of Correlation. 


Character L.M.F. LF. LC. S. 
| L.M.F. 1-000 0-763 0-832 0-661 
| LF. 0-763 1-000 0-757 0-715 
| LC. 0-832 0-757 1-000 0-803 
S. 0-661 ae 0-803 1-000 
TABLE VI. 


Probable Errors of Coefficients of Correlation. 


7. Regression Equations. Considering the significancy of all the coefficients of 
correlation shown above, we may Conclude that the variability between the human 
organs is not chaotic, but governed by very definite laws. We are therefore able 
to predict an unknown measurement in terms of the given length of a known 
organ. 


Table VII results from this investigation. This table shows us how and with 
what accuracy human characters can be calculated one from another. The 
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TABLE VII. 


Table for Application of linear Regression Equations. 


Computed from 


Character | Constants | | 
LM.F. | LF. LC. Ss 
in mm. in mm. in mm. in mm. 
m — 0-3790 0:2409 0-0684 
DMD ¢ = +16-4 43-9 40-7 
in mm. | 
p.e = = Ori 12-3 +3-1 
m 1-5359 — | 0-44.11 0-1490 
n | 
= = =a = = = | See aes |S == = 
elt. ae ¢ +824 = + 55-8 +106 
a in mm. 
= a EE Se ee _ = iene ees 
S 
nR 
cs p.e ae 5:4 = | SI lays) Ss 5:8 
q | 
8 m 2-8741 | 1-2990 = 0-2872 
ey | 
5 = | a 
2 LC. . | = 
3 nee c +129-6 + 122-8 -- — 18-7 
m 6:3839 3:4304 2-:2451 | - 
S. | = ‘ 
eM @ +929-9 +774-4 +631-0 — 
| p.e. +29-5 +27°5 ac 23°5 | —— 
| 


regression equations given in this table are of course linear, and are to be put 
into the form 


Y=ML+C, 
where y is the measurement looked for, 
x is the known measurement, 


m is the change of the dependent measurement for a unit change of the 
independent variable, 
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p.e. is the probable error for each calculated measurement, and is computed 
from 


0°67450yn® (1 — 72)? /(n — 2)3, 
and ¢ is a constant. 
All characters, stature included, are in millimetres. 


Besides the formulae given above, the following multiple regression equations 
for the determination of the stature may be of interest: 


Reconstruction of stature from a knowledge of one measurement: 
Stature = 6-3839 (Finger) + 929:9, 
Stature = 3-4304 (Foot) + 774-4, 
Stature = 2-2451 (Cubit) + 631-0. 
Reconstruction of stature from a knowledge of two measurements: 
Stature = — 0-2228 (Finger) + 2-2981 (Cubit) + 632-2, 
Stature =  2-6699 (Finger) + 2:4195 (Foot) + 730-2, 
Stature = 1-7136 (Foot) + 1-2034 (Cubit) + 564-2. 
Reconstruction of stature from a knowledge of three measurements: 
Stature = 1-4309 (Foot) + 1-9866 (Cubit) — 10166 (Finger) + 496-6, 


where all the values, stature included, are in millimetres. 


8. Summary of Conclusions. 


(a) The raw material, though taken from the anthropometrical slips of 
prisoners, is a fair sample of the ordinary native community. 

(b) The natives dealt with are not of the poorest nor of the lowest class of the 
Cairo natives. 

(c) The records are homogeneous, and are sufficient in number for a good 
statistical study. 

(d) The distributions of stature, cubit, finger, etc., are normal. 

(ec) The coefficients of correlation between all organs are significant, no matter 
to what race the records belong. 

(f) The variability between the human organs is not chaotic, but governed 
by definite laws which enable us to predict measurements one from another. 

(g) The coefficients of correlation for the human organs are practically identical 
from one race to another, only the coefficients resulting from the head measure- 
ments vary to any great extent. 

(hk) The great variation which the coefficient of correlation for the head 
measurements undergoes from race to race, is to be traced to special factors not 
yet determined, which influence this measure of relationship. I hope to in- 
vestigate this further. 
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TABLE VIII. Correlation Table of Length and Breadth of Head. 
Length of Head in mm. 
iron) ie) ™ i es ~~ | S Op) i © S Ny ”» 2) 
~ ~ ie) ee) oe) S SD lon Q S S S 
~N ~s ™ ™~ ™ ~ ™~ ™ ~ XQ fon] nN 
peel [| Total 
5 wD ee) ~ oS | 
~”n ao S S | 
~ ™~ | 
g 
|| 130—132 
= | 183—135 
“= | 136—138 
a2 139—141 8 4 4 3} 120 
® | 142—144 | 24 19 8 ] 2, 190 
Mo) 145—147 | 29 12 rial 1 1 | 202 
2 148—150 13 18 8 1 2, 125 
151—153 13 7 5 1 4 55 
| 154—156 2 2 1 = 1 12 
"S| 157—159 }-1 | 4]71]/—|=+—] 8 
© | 160—162 | ieee l 
faa 
| Total | 802 
fame 2 Bet | alee aa a} (een 
TABLE IX. Correlation Table of Left Foot with Left Middle Finger. 
Left Foot in mm. 
9 ron) NQ S Ss ia) = ~ “Hy ™~ —<) 
nN in) MH MH ie) ea) ~ ~ So) Sd S 
i) NX NX NX N YN ies) iow) ley) QQ YN | 
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TABLE X. Correlation Table of Left Cubit with Left Middle Finger. 


Left Cubit in mm. 
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Left Cubit in mm. 
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Measurements of Cairo-born Egyptians 


TABLE XI. 


Correlation Table of Stature with Left Middle Finger. 


Stature in cm. 
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388—402 
403—417 
418—432 
435—447 
448462 
463—477 
478—492 
493—507 
508—522 
623—537 
538—552 


TABLE XIU. 
Correlation Table of Left Foot with Left Cubit. 


Left Foot in mm. 


Total 


Left Cubit in mm. 


Left Foot in mm. 
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TABLE XIII. 
Correlation Table of Stature with Left Foot. 
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A STUDY OF THE CRANIA OF THE MORIORI, OR 
ABORIGINES OF THE CHATHAM ISLANDS, NOW IN 
THE MUSEUM OF THE ROYAL COLLEGE OF SURGEONS. 


By EVELINE Y. THOMSON, Crewdson-Benington Student in Craniometry, 
Biometric Laboratory, University College, London. 


1. Introductory. The fact that pure bred Morioris are now very rare if not 
extinct renders of special value any study of such crania of them as still exist. In 
papers referred to below Dr Scott* deals with 50 and Dr Duckworth} with 10 
Moriori crania. Professor Thane possesses two specimens and there are a few 
scattered through various British and Continental museums. Hence the 63 crania 
in the museum of the Royal College of Surgeons almost double the available number. 
Yet if we pool the present measurements with such of Dr Scott’s series as are homo- 
geneous, our results are still too slender to warrant the publication of more than 
the measurements themselves and their mean values. As one of the best uses of 
methods of treatment which do not extend beyond the mean the type sections of the 
Moriori crania have been reconstructed. By most kind permission of Professor Arthur 
Keith I have been able to study the collection of Moriori skulls in his charge, and 
he most readily provided facilities for the photographing of a considerable number 
of the crania. Although the collection has been for a considerable time in the 
Royal College of Surgeons no complete set of measurements appears to have been 
published {~. Dr Duckworth in the paper just cited makes some remarks on a 
few of the abnormal skulls. Forty-four crania were purchased by the Royal 
College as “from the Chatham Islands” in 1892, i.e. 765! to 76544, six, 765% to 
765°°, appear to be later additions, while thirteen, 758-760, 7601, 760?, 761-765, seem 
to have an earlier origin, as well as the three 1015, 10154 and 1016 from the Barnard 
Davis collection. The very remarkable character of the Moriori skull is not fully 
accounted for by the outlying character of their island, although anthropologists 
naturally look to such islands for the survival of primitive types. Chatham 


* Contribution to the Osteology of the Aborigines of New Zealand and of the Chatham Islands. 
Transactions of New Zealand Institute, Vol. xxvi. pp. 1—64. Wellington, 1894. 

{ On a collection of Crania of the Moriori, Journal of the Anthropological Institute, Vol. xxx. p. 141 
et seq. London, 1900. 

{ Hight have certain measurements given in Flower’s Catalogue of the Royal College of Surgeons. 
Barnard Davis gives measurements of three others in his Thesaurus Oraniorum. 
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Island lies to the east of New Zealand. According to their own traditions the 
Moriori reached their present home direct from Hawaiiki, so that they are not 
degraded Maoris but form a special branch of the Polynesian race. Never numerous, 
they were mostly slaughtered in the year 1838—two years previous to the first 
British colonisation of New Zealand—by a party of Ngatilawa landed on the 
island by a whaler. This party enslaved the few survivors. Some years ago the 
remaining Moriori were stated to be reduced to fifty, and these were probably not of 
pure race*. There was nothing of special note in their occupations, habits or beliefs. 
Most of their life was utilised in procuring food, making garments and sleeping 
mats, or in shaping stone implements in which they showed considerable ingenuity. 
Fish hooks, fish spears and gaffs in constant use were cleverly shaped out of bone. 
The dwellings were ordinary conical huts constructed out of fern tree and thatched 
with toi-toi grass; they were often sufficiently large to accommodate twenty or 
thirty persons. Some of them were ornamented with rude carvings, but most 
of these places have been demolished by the Maori, or perished from age, so that 
no specimens of their art are now obtainable. They lived peaceably, chiefly along 
the coast, spending their time as a rule in procuring supplies of fish, fern roots, 
nuts, wild duck, ete. Periodically they put off in rafts or large canoes for the 
hunting of seal and albatross. They are said to have had a confused notion of 
good and evil spirits and to have invoked numerous deities prior to any under- 
taking of importancet. In short, such accounts as we have of them do not 
seem to differentiate them markedly from the inhabitants of many other Poly- 
nesian islands. They stand out differentiated rather by their cranial than by 
their cultural characters. 


2. Measurements and Methods of Measurement. The measurements I have 
taken are similar to those adopted by previous biometric craniologists. They 
were made in the manner described by Fawcett (Biometrika, Vol. 1. p. 416) and 
Macdonell (Ibid. Vol. 111. p. 200). The measurements are as follows: 


C = capacity taken with mustard seed, packing in skull and in measuring glass, 
not by our usual laboratory method of weighing as no fine chemical balance was 
available, the work being done away from the Laboratory. In the tables it will 
be seen that I have given two figures for each capacity. The seed was tightly 
packed into the skull and then tightly packed into the measuring glass. This is 
the standard method. It involves the great labour of double packing; the second 
packing is avoided in our laboratory use of the balance. It occurred to me to be 
worth while trying loose packing in the measuring glass and correlating the results 
for tight packing and loose packing with the view of saving the time employed 
in the second tight packing on another occasion. The loose packing consisted in 
taking the seed from the skull, gently pouring it into the measuring glass and 
then giving a slight shake sufficient to obtain a level surface for reading. The 


* R. Lydekker, Living Races of Mankind, Vol. i. p. 72. 
+ Extracted from Notes on the Chatham Islands by J. W. Williams, of Waitangi West, Chatham 
Island. Journal of the Anthropological Institute, Vol. xxv. p. 343. London, 1897. 
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results are given in the footnote below*; it suffices to remark here that for practical 
purposes it appears approximate enough if we deduct one-twentieth from the 
volume of the loosely packed seed in order to obtain the volume of the tightly 
packed. My corrected results from loose packing are given in brackets in the 
tables. The means are: 
Tight packing, 3 1422-07; Loose packing, ¢ 1423-03, 
- 5 2 1310-73; . * 2 1308-47. 

The mean difference (regardless of sign) of the results obtained from loose 
packing from those obtained from tight packing is, however, 8cm.3 How much of 
this is due to variation in the tight packing in the measuring glass, how much to 
fortuitous settlements in the loose packing it is not possible to say. The differences 
suggest at any rate that a method may well be devised by which tight packing 
in the measuring cylinder may be avoided. 

Further measurements made are: : 

F = Flower’s Ophryo-occipital length. ZL’ = glabellar horizontal length. 
£=maximum length from glabella to occiput. B= maximum horizontal 
breadth. B’ = least forehead breadth from one temporal crest to the other. 
H = Basio-bregmatic height. OH = auricular height as measured on the 


* Let C, be the capacity from tight packing, C, from loose packing; let « be a standard deviation, 
V a coefficient of variation and 7, the correlation of C,; and C,, then in cubic centimetres: 


For male crania. For female crama. 
Mean C;= 1422-07 Mean C,= 1498-11 Mean C,= 1310-74 Mean (,= 1377-37 
o, = 110-185 o, = 113-263 o,= 90-265 o, = 90-122 
Vii ae Vi= 17:56 V,= 6-89 V,= 6-81 
rq = 9935 ry = 9969 


It will be seen that the correlations are very high. The regression lines giving C, for C, are: 
C, = :96654 C, — 25-91 for males, 
CO, = -99847 C, — 64:53 for females. 
We should have expected the same line to have resulted from the two series, as it does not it is clear 
that the coefficient of C, is a function of C,;. To determine the nature of loose packing a little further 
we took the regression lines of (C, — C;)/C, and C, or equations of the form (OC; — C;)/C;= AC, + B. 


There resulted: 
(C, - C,)/C, = 06843 — -0000,1173 C, for males, 


(C, — C;)/C, = -056939 — -0000,0619 C, for females. 
These may be written: 
C, = C;, (-93157 + -0000,1173 C,) for males, 


C,= C, (-94306 + -0000,0619 C;) for females. 
The terms depending on C;? are so small that we may insert in them for C,, the mean values in each 


case, when we find: 
C, = C, (-94914) for males, 


C, = C, (:95159) for females. 
The differences are now not unreasonable considering the paucity of the data, and we see that very 
approximately it suffices to take for both sexes, C, = C, (1 — zy), the rule given above. 

The whole matter is to be reconsidered on more ample material. It is not clear at present how 
far it depends on personal equation in tight and loose packing of different amounts and how far on 
the diameter of the measuring glass, i.e. the pressure varying with height or with C’, and so affecting 
the looseness of the loose packing. 
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craniophor, that is the height perpendicular to the horizontal plane above the line 
joining the upper points of the margins of the auricular orifices. LB = length 
from basion to nasion. ZP = profile angle, measured with goniometer, the 
skull being on the craniophor. Q = transverse arc from upper margin of one 
auricular passage to upper margin of the other, the arc being taken perpendicular 
to the horizontal plane with the skull on the craniophor. This perpendicular 
are goes generally behind the bregma through a point, the “apex,” which is 
marked on the skull by the sliding bar of the craniophor, i.e. the bar by which 
OH is obtained. Q’ is the same measure taken when the skull is off the craniophor*. 
S = sagittal arc from nasion to opisthion. S,= are from nasion to bregma, 
S, = are from bregma to lambda, S; = arc from lambda to opisthion. 8S’; = chord 
from lambda to opisthion. U = horizontal circumference measured directly above 
the superciliary ridges and round the most projecting part of occiput. G’H = upper 
face height from nasion to alveolar point. GB = face breadth from the lower 
end of one zygomatic-maxillary suture to that of the other. J = zygomatic 
breadth, from the most lateral point of one zygomatic arch to that of the other. 
NH = nasal height from nasion to the lowest edge of left pyriform aperture (in 
cases where this is damaged the lowest edge of right aperture is taken; if there 
be a marked difference between the heights of the two apertures, both heights 
are given). NB= nasal breadth, greatest breadth of nasal aperture, wherever 
it may be. O,= breadth of orbit, R and L, the greatest breadth from side to 
sidef. O,= height of orbit, R and L, taken perpendicular to O,. G, = length 
of palate, from the point of the spina nasalis posterior to an imaginary surface 
tangential to the inner alveolar surfaces of the mesial incisors. 6’, = length 
from base of spine. G,—= breadth of palate between the inner alveolar walls at 
the second molars. GL = profile, or basi-alveolar length, from basion to alveolar 
point. fml = length of foramen magnum; fmb= greatest breadth of foramen 
magnum. (GH = face height. 

Weight of skull in grams. In this series the actual weight of the skull (measured 
without mandible) was taken, as the crania being so abnormally heavy, it was 
thought the individual weights of the skulls might be of some interest. 

The indices calculated are: the cephalic indices 100 B/L’, 100 H/L’, 100 B/L, 
100 H/L and 100 B/H; the upper facial index 100 G’H/GB; the nasal index 
100 NB/NH,; the right and left orbital indices 100 O,/0,; the palate index 


100 G,/G,: the occipital index Oc. I. = 100 e 54 gt s) ; the foraminal 
index 100 ae . Finally the gnathic index has been obtained from mean values. 


fml 


* Q is taken with the tape over the skull and measured from the top of one ear rod to the top of the 
other; it is somewhat troublesome to take. Q’ is easier, but owing to the recessing of the margins 
of the auricular passages is not quite the same as Q. The differences in Q and Q’ are partly due to this, 
partly to observational error. The mean values: Q = 316-11 for gs, = 310-00 for Qs, and Q’ = 315-94 
for js. = 310-20 for Qs, show that there is close concordance of both methods in determining type. 

+ On the difficulties of measurement of O,: see Biometrika, Vol. mt. p. 201. 
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The twelve mandibles available were measured: w,= condylar width, or 
greatest width from outside one condyle to outside of second; w, = greatest width 
at angles from outside one angle to outside of the other; fh, = greatest height, 
from lowest median projection to top of alveolar margin between mesial incisors ; 
and zz= distance between foramina mentalia. The mandibles were too few in 
number to justify a more elaborate system of measurements. Also the following 
angles were taken in the manner described by Cicely Fawcett*: (a) profile angle 
(ZP) as already mentioned measured with Ranke’s goniometer; (b) angle between 
upper face length and profile length, the alveolar angle (7A), determined from 
the length measurements GL, G’H and LB by aid of Pearson’s trigonometer; 
(c) angle between skull basis, and upper face length, the nasal angle (ZN), deter- 
mined as in (6); (d) angle between skull basis and profile length, the basilar angle 
(ZB), determined as in (6); (e) angle between skull basis and the standard horizontal 
plane (6,), the basio-nasal horizontal angle, determined as in (b) with the use of 
profile angle as found by goniometer. (f) Angle between profile length and the 
standard horizontal plane (@,), the basio-alveolar horizontal angle, determined 
as in (e). 

Sex. I first recorded my own opinion as to the sex of the skulls, and then 
Professor Pearson gave me his judgment on them. In one or two cases, where 
it was most difficult to determine the sex, and we could not agree, appeal was 
made to Professor Keith, and he kindly gave his opinion also. 


3. Contours. A study of the three chief contours of these crania was made. 
These contours were drawn by aid of a Klaatsch contour tracer, in the same manner 
as the contours traced by Dr Crewdson Benington in his paper in Biometrika, 
Vol. vit. p. 123, on Cranial Type Contours. From these contours were obtained 
the average or type contours of the Moriori crania. The construction of the type 
contours was as follows: 


(a) The transverse, vertical, or auricular coronal section. The skull being 
placed on the craniophor, and adjusted in the usual manner to the standard 
horizontal plane, the horizontal rod of the craniophor determines the “apex” 
of the skull. The plane through the apex of the skull (which is marked with a 
pencil stroke) and the “auricular points” is the transverse vertical plane of the 
skull. The skull being now transferred to the Klaatsch apparatus, the vertical 
plane is made horizontal and the contour traced from “auricular point” to “auri- 
cular point” through the apex. The line joining auricular point to auricular 
point is the auricular line of the contour, and the line perpendicular to it through 
its mid-point is the vertical axis MA of the transverse vertical contour. This 
vertical axis is divided into ten equal parts by aid of the proportional compasses, 
and lines parallel to the auricular line are drawn through the points of division ; 
these lines will not necessarily be bisected by the vertical axis owing to the asymmetry 
of the skull. They are numbered from below upwards 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 


* Biometrika, Vol. 1. p. 418. 
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where | represents the lengths on the auricular axis. An eleventh line is taken 
one-quarter of the last section from the apex, the distances to R and L of the 
vertical axis up to the contour are measured off along these eleven horizontals. 
Also the length of the vertical axis MA is measured. Since the skull is placed 
with the occiput downwards on the Klaatsch, the right side of the skull is to 
the observer’s left when viewing the contour, and the left of the skull to his right. 
This is reversed in plotting type-contours in order to obtain the norma occipitalis*. 

The following measurements are now entered in the columns of the record for 
each contour—the sum of 1R and 1Z being the interauricular diameter : 


| 
M.A. lk | 1L 
| 


| 
| TR ee BR) Sia) OR | OL | alee A}R | ALL 


me 


The means of the columns give the corresponding lengths for the type skull 
and from these we can plot the points on the type section; this section is then 
drawn between the points by aid of a spline, and we thus reach the transverse 
type contour. See Figs. III and IV for male and female respectively. 


(6) The glabellar horizontal section. This is obtained by placing the skull on 
the craniophor, and by aid of the scriber marking points on the same horizontal 
plane as the nasion and the gamma, in the region above both auricular passages. 
The skull is now adjusted in the Klaatsch holder, apex downwards, until these 
four points are shown to be in the same horizontal plane. The horizontal contour 
is then drawn parallel to this plane but through the glabella; it will fall below the 
lambda, and does not pass far from the occipital point. The points in the median 
plane immediately above the nasion and below the lambda on this contour are 
especially marked with the Klaatsch tracer. They will be spoken of as F and O, 
the glabellar and occipital points. FO is the fronto-occipital line of the horizontal 
section. This fronto-occipital line being found, it is divided into ten equal parts 
by the proportional compasses, and lines perpendicular to the fronto-occipital 
line are drawn through the points of division, and the lengths up to the contour 
measured right and left. To obtain the frontal and occipital curvatures, parallel 
lines at F} and 0} were taken as in the other cases. 

In the case of the Moriori contour there were such sharp projections at the 
linea temporalis that ordinates perpendicular to the fronto-occipital line both 
right and left were drawn to the most projecting points. These are the ordinates 


* Benington (Biometrika, Vol. vim. p. 151 etc.) gives the transverse section as norma facialis, and 
his tracings must be reversed in superposition on mine. 
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T, and T; with abscissae zp and z,. To complete this contour in the neighbour- 
hood of the glabella extra lines were measured at quarter and half the distance 
from F to 2; these lines are spoken of as the F} and F} lines. All these points 
being recorded they were found sufficient by aid of a spline to reconstruct the 
horizontal section effectively. 


When we look at any horizontal contour with the frontal point towards us, 
the right of the skull is on our right, as it would be if we were facing the actual 
skull the right way up. The ordinates are reversed in plotting the type contour in 
order to obtain the norma verticalis. 


We have the following measurements entered in our recording book: 


FO | FER run ran FAL| 2R | 2b | 3R | 3L | 4k | 4L 


Bebe goo" 


TR | 7L | 8k | 8L | 9R | OL | 10R 


BR | 6R om 


| 


BReee) 


The means are again found for all the columns. The results are plotted, splined 
in, and we obtain the type horizontal section for g. and 9. See Figs. V and VI. 


cL 


(c) The sagittal or median section. The skull being again placed on the cranio- 
phor and adjusted to the Frankfurt horizontal plane, the point y on the same 
horizontal level as the nasion is marked off on the median plane of the skull by 
means of the scriber. The skull is now removed to the Klaatsch, and adjusted 
so that the nasion, bregma and lambda lie in the same horizontal plane. The 
contour is then drawn, starting from the alveolar point. The alveolar point, the 
nasion, the bregma, the apex, the vertex, the lambda, the gamma, the inion, the 
opisthion, the basion, and the auricular point are all marked on this contour drawing. 
We now take the horizontal line through the nasion, i.e. the line linking the nasion 
to the gamma, as our “horizontal axis” and divide it into ten equal pieces by aid 
of the proportional compasses. The nasion counting as 0, and the other divisions 
from nasion to gamma as l, 2, 3, 4...9, ordinates are drawn through these ten 
points perpendicular to the horizontal axis. Lines 8 and 9 are produced, and 
continued on the other side of the horizontal axis; a further upright is taken near 
the gamma at } the distance from the gamma to 9; this for brevity is termed the 
yi line; this line is also produced and continued below the horizontal axis. At 
0, i.e. vertically through the nasion, there will in most male skulls be a sensible 
ordinate roughly of about 20mm. _ In order to get the curvature in the neighbour- 
hood of the glabella a vertical tangent was drawn by aid of the set-square to the 
most projecting part of the glabella; the point in which the tangent meets the 
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prolonged gamma-nasion line being 7 and the point of contact of the tangent 
being G, the ordinate GT and the abscissa TN represented by the letters G, and 
Gz respectively were measured, they are practically the coordinates of the glabella. 
To complete the drawing of the frontal part of the sagittal contour, an extra 
ordinate was measured at } the distance from 0 to 1; this line is spoken of as the 
Ni line. Perpendiculars were also dropped from the opisthion, auricular point 
and suborbital point to Ny. Also a perpendicular from the most projecting point 
of the contour just above the inion was drawn to Ny. These points will now 
be found amply sufficient to reproduce the individual contour as closely as it could 
be redrawn by the Klaatsch. 

In our record book, we have accordingly the following entries, as well as the 
number and sex of the cranium: 


Ordinates above Ny 
B-sub. | B-sub. | V-sub. _ _ 
BYE Monn | toGl | toar | om | | 
O lve | 23 | 4156 | 6 7 | 8°] 9 | 42) 48 
| 
| | | q | 
| 
| — = 
Ordinates  |- ‘ are Me 
below Ny Oce. r Suborb. Aur. Op. Lengths Angles 
BB laalotl el ala) y Pal y eM y aa y |GL|NT) Be | Bae |! | Bar| Mews 4 4(¥ 4| Be | Pz 


The type contours are constructed as before. See Figs. I and II. 


(d) General remarks on Contours. Illustrations of the type contour of the 
Moriori are shown in Figs. I to VI. For the sagittal contour: 

N = Nasion, Gl. = Glabellar point, B = basion, B = Bregma, V = vertex, 
A=Lambda, y=Gamma, I = Inion, Op.= Opisthion, Aur. = Auricular point, 
Bas. = Basion, Alv.= Alveolar point, Suborb. = Infra-orbital point. Swuborb. to 
Aur. if joined by a line gives the standard horizontal plane. Ny is the horizontal 
plane through the nasion. The angles N, B and A are the nasal, basilar and 
alveolar angles of N B Alv. the fundamental triangle. P= the profile angle measured 
from the Frankfurt horizontal, ¢ =the angle between the glabella-inion and 
glabella-bregma lines, 4’ = the angle between the nasion-inion and nasion-bregma 
lines. There are three interesting subtenses, $s, wand y. y is the maximum subtense 
or height of the calvaria measured from the glabella-inion line, sis the corresponding 
bregma subtense and «x is the bregma subtense to nasio-lambda line. 


The most marked characteristics of the Moriori skull are the depressed 
and retreating forehead, the bulging glabella, and the high sagittal crest. On 
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looking at Figs. I and II, it will be seen that these points are well demonstrated 
in the type sagittal contours. The glabella is not of course so pronounced in 
the female contour, but it is pronounced for a female skull. Figs. III and IV, 
illustrating the transverse contours of these skulls, show nothing particularly 
characteristic; the sagittal crest can be seen, but it is more marked further 
back than the transverse section. Figs. V and VI, showing the horizontal 
contours, demonstrate very markedly that the Moriori skull is of a low type. 
There is a very substantial difference between the type horizontal contour of the 
Moriori skull and that of the English skull illustrated in Dr Crewdson Benington’s 
paper on Cranial Type Contours: see Biometrika, Vol. vii. p. 123. It may also 
be noted that in these Moriori horizontal contours, there is a decided flattening 
at the occipital. 


Before we discuss the quantitative value of these contours it is well to examine 
the arithmetical results obtained for the measured characters. 


TABLE I. Mean Values. 


Character 3} 2) Character ey je) 
Garnet 1422-07 1310-73 Gy | 53-12 50-46 
ae (1423-03) * (1308-47) * Gy 38°85 37-10 
Weight 764-27 622-23 GL 100-93 97-36 
F 182-00 176-09 fml | 35-29 34-41 | 

L’ 185-64 177-77 fimb | 30-78 30°15 
L 186-87 178-60 +H 121-8 108-57 

B 141-40 | 138-41 Wy 123-6 114-0 
B’ 95°31 94-01 Wo | 106-4 101-87 
H 135-91 132-81 hy 33°17 30:27 

OH 117-05 115-08 zz 51:48 48-5 
LB 105-60 102-31 Oc l 62-17 64-15 
Q 316-11 310-00 B/L’ 76-51 77:90 
an 315-94 310-20 A/L’ 73°23 74:76 
S 368:°77 354-40 B/L 76-07 77-52 
S, 126-03 121-39 A/L 72-78 74-52 
S, 122-40 | 117-36 B/H 104-53 103-64 
S3 121-10 115-95 GWH/GB 74:13 72-09 
8’ 102-34 99-60 NB/NH 43-91 48-22 
U 522-79 502-59 0,/0,L 84-03 84:30 
GH 76-40 70-01 O,/0,R 84-35 84-59 
VB 103-15 96-63 G/Gy 73:20 73-71 

J 137-43 128-93 ‘ml = 
NH 57-33 | 52:53 He 87-43 S718 
NB 25-30 | 25-27 NL 64-84 65-67 
OR 44-61 42-67 AL 71-75 73-64 
OL 44-42 42-36 BL 43-39 40-67 
OR 37-59 | 36-01 65 12-88 10-84 
OL 37-29 35-52 6, 30-33 29-83 
Gy 50-10 47-38 PL 84°-69 84°-50 


* Values obtained by ‘‘ loose packing.” 
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4, Mean Values of Characters. To judge of the degree of accuracy possible 
in reaching a Moriori type, we must note that the means were based for males 
on 33 to 35 crania and for females on 20 to 23 crania, according to the character 
measured; the one exception being capacity, for which only 28 male and 19 female 
crania were available. The non-adult crania have been omitted from the mean 
value determinations. The values are given in Table I. The question now arises: 
how far can we consider these values as adequately describing the Moriori skull? 
We may first compare our results with those obtained by Dr Duckworth for the 
Moriori crania at Cambridge*. Owing to the smallness of his series (7 male and 
3 female) he has pooled his sexes to obtain his means. We have pooled ours in 
the same proportions; the following values result: 


Thomson Duckworth 
Maximum Length _... nee 184-4 182-5 
Maximum Breadth ... so 140-5 142-5 
Ophryo-Iniac Length soe 170-54 173-0 
Basi-bregmatic Height es 135-0 132-5 
Nasal Height ... ses aes 55-9 54:2 
Nasal Width ... si aise 25:3 24-2 
Basi-nasal Length ... es 104-6 101-2 
Basi-alveolar Length ... ss 99-9 97-2 
Horizontal Circumference _... 516-7 513-6 


With the exception of the breadth and our doubtful ophryo-iniac length, 
Dr Duckworth’s crania appear slightly smaller than those at the Royal College 
of Surgeons. A somewhat closer result is obtained if we pool our males and females 
without regard to Duckworth’s proportions. His numbers are, however, too few 
for us to do more than say that his results agree not unreasonably with ours f. 


* loc. cit. p. 150. No. 4 is given ?9. No. 2 is given as sex doubtful on p, 142, but as definite 3 
in table on p. 150. 

{ Obtained only from contours. 

{ The German anthropological catalogue (Die Anthropologische Sammlungen Deutschlands, Bd. 1. 
1877—1902, Berlin v. 8. 51—2, Bresike Catalogue, 1880) provides measurements of three Moriori crania 
from the Chatham Islands in the Anatomical Museum of Berlin University. Two of the skulls are of 
children and the third of an aged person. They are thus of little use for comparative purposes. 
The measurements given of one of the young skulls which retains one milk tooth and is metopic are: 

C—WN300 i —70,00b — loi, Hi— 26), S— 121) 8S, — 124) 3S, — 108, Ga — 97: 

The second child skull is metopic, has no 3rd molars and the palate is not yet closed; further 
it is deformed, a large area on right side of temporal and left side of parietal being much depressed. 
The measurements given are: 

C= 1140, L= 164, B= 133, H = 126, 8,= 116, S,= 104, S,= 125, GH = 93. 

The aged skull has depression of the parietals, alveolar rim much absorbed after loss of incisors 

and the three sutures completely synostosed. The measurements given are: 

C = 1450, L = 183, B= 136, H = 136, S, = 130, S, = 125, S; = 120. 
The greatest length, assuming it a male, is somewhat below the average, H is about normal, S; and 
B somewhat below average, but the other characters including the capacity above the average, but 
not greatly. It appears from the description—namely roof shaped, with slightly retreating forehead, 
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More valuable comparative material is that of Scott*. I have placed in 
parallel columns in Table II those measurements of Dr Scott which in any degree 
correspond with mine. We agree so closely in the main straightforward measure- 
ments, that where a really substantial difference is apparent, I think it must be 
attributable to differences in method of measuring rather than to differences of 
material. I ought to say that I am responsible for the determination of the means 
from Dr Scott’s tables of measurements. While the sex ratio in my: material is 
roughly as 3:2 that of Dr Scott’s is 3:1, so that he has only about half my number 
of females, although his males are about the same in number as mine. I think it 
probable that we have followed a different appreciation of sex, because, while his 
male measurements are on the whole very close to mine, practically all his absolute 
female measurements are less; thus, the difference he gets between the sexes is far 
greater than my difference, and rather greater than is usual among uncivilised 
races, where there is a greater tendency to equality in the measurements than 
among civilised races. I would venture with all deference to suggest that possibly 
size has weighed too much with him in sexingy. The first difference between our 
results to be noted is 33 cm.3 in the male capacity, but he used shot and I used 
mustard seed and a difference of this order may well arise with such difference of 
method. It is more or less confirmed by the result for female crania, because 
although his skulls are distinctly less than mine in size, he gets only a difference 
of 19cm. in the capacity. Our Ff, L, B, B’, H and (?) LBt values are in 
good agreement for males; so also are our U, G’H, J, NH, NB, O,L, G,, GL, 
fml, fmb, and all the indices except the orbital index for the same sex. Our 
difference in the orbital index is due to measurement of the breadth O,L, but 
this is precisely a measurement, which, without absolutely identical method, often 
leads to great divergence. Turning to our differences we first note Q, the 
transverse arc; the considerable difference here is repeated in the females. 


prognathous, and with the teeth very much ground down—to have the usual Moriori characteristics. 
Four crania, one a child’s, two male and one female, were collected during the Challenger Expedition. 
They are described and measured in the Challenger Reports, Zoology, Vol. x. p. 73, Report on the 
Human Crania. The two males are clearly fine crania, exceeding our average, the female is below 
average. Three crania are described by Zuckerkand! in the Reise der Novara; Anthropologischer Theil, 
1 Abth. S. 104; but the few measurements given are of small use for comparative purposes. Finally 
five at Paris are considered by de Quatrefages and Hamy in the Crania Hthnica, p. 461, Paris, 1882. 
The three male crania appear to be very large as compared with our series, giving a mean 
C= 1600 em.3 (!), U= 548, LD= 194, B= 146, H = 137 and J = 142, while the two females have a mean 
C= 1565 cm.3 (!), U = 524, D = 185, B= 144, H = 141, J = 132. 

Either the measurements are not according to our standards or the crania have been selected as 


specially remarkable skulls. 

* loc. cit. on our p. 82. 

+ We have included Scott’s three doubtful crania with his females, not only because he himself 
in his table puts them in that section, but because, being the rather larger crania of the section, they 
somewhat reduce the differences between his and our female mean characters. 

{ I think it possible that Dr Scott and I have not determined the alveolar point in precisely the 
same manner, and IJ have accordingly bracketed this measurement and the resulting Gnathie Index 


in Table II. 
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TABLE II*. 
Mean Characters of Moriorr Crania, Thomson and Scott. 
Males Females 
Thomson Scott Combined Thomson Scott Combined 
Capacity 1422-07 (28) | 1454-97 (29) | 1438-81 (57) | 1310-73 (19) | 1291-22 (9) 1304-46 (28) 
1 eee 182-00 (35) 181-08 (32) | 181-56 (67) 176-09 (22) | 172-60 (10) 175-00 (32) 
DR os 186:87 (35) 186-84 (31) 186-86 (66) 178-60 (22) 175-60 (10) 177-66 (32) 
1 eee 141-40 (34) 142-10 (31) | 141-74 (65) 138-41 (22) 134-20 (10) 137-09 (32) 
Bie ses 95-31 (35) 94:09 (32) | 94:73 (67) 94:02 (22) 91-20 (10) 93-14 (32) 
jg Leet 135:91 (34) 135-59 = (32) 135-75 (66) 132-81 (21) 128-10 (10) 131-29 (31) 
Bis. 105-60 (34) 103-75 (32) | [104-70] (66) 102-31 (21) 98-70 (10) | [101-15] (31) 
(Oar 316-11 (34) 307-12 (32) | [310-81] (66) 310-00 (21) 289-00 (10) | [303-23] (31) 
Siuee 126-03 (34) 130-25 (32) | [128-09] (66) 121-39 (23) 120-40 (10) 121-09 (33) 
iSfp-eac 122-40 (34) 122-56 (32) 122-48 (66) 117-36 (23) 117-00 (10) 117-25 (33) 
Seance 121-10 (34) 122-84 (32) 121-94 (66) 115-95 (22) 114:00 (10) 115-34 (32) 
IS Sue 368-77 (34) 375-75 (82) | [872-15] (66) 354-40 (22) 351-40 (10) 353-46 (32) 
(Of ape 522-79 (34) 521-97 (31) 522-40 (65) 502-59 (22) 493-80 (10) 499-74 (32) 
GH 76:40 (35) 75:86 (28) 76:16 (63) 70-01 (21) 67:25 (8) 69-25 (29) 
4 eee 137-43 (34) 138-89 (28) 138-09 (62) 128-93 (20) 127-10 (10) 128-32 (30) 
NH 57°33 (35) 57:50 (32) 57-41 (67) 52-53 (21) | 50-90 (10) 52-00 (31) 
NB 25:30 (35) 26-47 (32) 25:86 (67) 25:27 (21) 24-80 (10) 25-12 (31) 
OL 44-42 (35) 41:53 (32) [43-04] (67) 42-36 (21) 39-10 (10) [41-31] (31) 
OL 37-29 (34) 36:75 (32) | 37:03 (66) 35:52 (21) |} 35-30 (10) 35:45 (31) 
Gare 53-12 (33) 54:10 (380) 53:59 (63) 50-46 (21) 49:00 (10) 49-99 (31) 
GL 100-93 (34) 101-42 (31) 101-16 (65) 97:36 (21) 96-50 (10) 97-08 (31) 
fal... 35-29 (33) 35:53 (82) | 35-41 (65) 34-41 (20) | 34-00 (10) 34-27 (30) 
fimb 30°78 (34) 30:94 (32) 30-86 (66) 30-15 (22) 29-70 (10) 30-01 (32) 
B/L 76:07 (34) 76-29 (30) 76-17 (64) 77-52 (22) 76:44 (10) 77-18 (32) 
A/L 72-78 (34) 72:60 (30) 72:70 (64) 74:52 (21) | 72:96 (10) 74:02 (31) 
NB/NH 43-92 (34) 46-14 (32) 44:99 (66) 48:22 (21) 48-75 (10) 48:39 (31) 
0,/0, tae 84:03 (33) 88-64 (32) [87-65] (65) 84:30 (21) | 90-32 (10) [86-24] (31) 
fml/fmb —... 87:43 (32) 87:20 (32) 87:32 (64) 87:78 (20) 87:44 (10) 87:67 (30) 
Gnathic Index 95-58F (34) | 97-81 (31) [96-64] (65) 95-16 F (21) 97-18 (10) [95-81] (31) 


* Figures in round brackets give the number of skulls measured. Means in square brackets are based 
on possibly non-comparable methods of measurement. 


+ Found from mean lengths only. 


I think it is due to difference in method. Assuming Scott to have measured 
from auricular margin to auricular margin, he has probably measured the least 


(or geodesic) arc, and not through the “apex’”’*. 


Our difference in the frontal 


part of the sagittal arc, S,, is very possibly due to his more careful pressing of 
the tape round the glabella, especially as it is less sensible in the female results. 
The upper face height 1s a measurement subject to some variation and doubt 
if the alveolar ridge be at all worn; our value of G’H seems reasonable, but 


Scott’s to be in excess of any comparative material. 


* He measured the entire transverse circumference and divided it into ‘“supra-’ 


auricular parts, not further defined. 


> 


His values for the female 


and ‘‘infra-” 
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differ in upper face height in exactly the opposite direction. Thus in the 27 
measurements compared in the case of the male crania, our differences in four, 
the capacity, the transverse arc, the frontal section of sagittal arc and the orbital 
breadth (involving a fifth, the orbital index) are, I believe, due to differences in 
the methods of measurement. In the case of the female measurements the com- 
parison is not so satisfactory; not only are my numbers small, but Scott’s are still 
smaller. This fact, however, would not account for his absolute lengths being 
almost invariably smaller than mine. As I have said I think he may have laid too 
great stress on mere smallness of the skull in sexing. Notwithstanding the combined 
measurements, transverse arc, orbital breadth and orbital index excepted, give 
even in the case of the female crania by no means unsatisfactory results. Unless 
—which seems now very improbable—further long series of Moriori crania are 
found, Table II, combining Scott’s results with my new measurements, must stand 
as the best available svstem of Moriori cranial data. It involves measurements 
of nearly 100 skulls of a race now practically extinct, and should be helpful in 
determining the racial affinities of the Moriori. 

Before leaving Scott’s important paper, I should like to note some of the 
peculiarities of the Moriori crania emphasised by him. He states that typical 
Moriori crania are usually large and heavy; our table of comparative measure- 
ments (Table III) amply verifies this statement as to size. As to the weights 
we find: 


Mean Weight of Crania. 


Moriori Fuegian Egyptian 

(Thomson) (Sergi and Martin) (Davin) 
Males be do 764-3 grs. 872:3 grs. 623-6 grs. 
Females... 500 622-2 grs. 722-5 grs. 482-4 grs. 


It is possible that the Fuegian crania were weighed, unlike ours, with mandible 
attached. If so this would account for some, if not all the difference observed. 
Clearly the Fuegians have very heavy crania, and the Moriori heavy crania, 
but the latter are not absolutely unique in this respect. The average weight of 
an Egyptian mandible is 73-1 grs., but of course in the case of these crania the 
material is in a very different state to the Moriori crania. Still I do not think 
that the observed difference, 108 in males and 100 in females, can be wholly due to 
mandibles being included in Sergi and Martin’s weighings of Fuegian and not 
included in our weighings of Moriori. I am inclined to think that while the Moriori 
crania are heavy, they are not as heavy as the Fuegian. The contours also bring 
out the same point of great size. 


Scott further draws attention to the prominent parietal eminences, the roof-like 
vertices and the pentagonal character of the norma occipitalis. Our Plates XIV, 
XV, IV, and VI will amply demonstrate to the reader these Moriori features. 
Further he remarks on the low flattened retreating frontal region as a most 
striking feature (cf. our Plates III, VII and XV). This character is well measured 
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by the frontal index = 100 x subtense of nasio-bregmatic arc/nasio-bregmatic chord. 
The value of this index for the Moriori crania is ¢ 19-39, 2 21-14. Comparative 
numbers are: 


Cranium of the Murderer Thurtell § 21-65 
Whitechapel English of 17th century S$ 22-21 
Guanche Crania a 22:50 
Egyptians of XXVI-XXX Dy aie $ 22-77 
Eskimo Crania ee Deri! 
Negroes (Congo) 3 22-80 
Cro-Magnon Skull . 3 24-27 


It will be seen that the male Moriori stands lowest on the list of racial values 
available. 

Of further points noted by Scott the statement* that “the excess of width 
over height is generally well-marked; indeed in the most typical members of 
the group the brain case is distinctly flattened,” needs, perhaps, further con- 
sideration. The feature in question may be measured by the use of Pearson’s 
index 100(B— #H)/L. We have then the following racial table: 


Values of 100 (B— H)/L. 


Wiirtemberg Women «=. =©6r + 96 Aino Women ... ate aes + 0-9 
Wirtemberg Men + 9:5 Aino Men wae sits - +09 
Bavarian Women = + 92 Gaboon Men (1880) ... ae + 0-9 
Bavarian Men : + 9-0 Maori Men ... Sor poe a Pa 
English iajomen (Mooriclas)... + 7:9 Maori Women ae son ae 405) 
French Men ... Spe 4 Coptic Women oe we = 6 + 05 
English Men (Moorfields) toe chan Gaboon Women (1864) Soe (yas (085) 
English Women (Whitechapel) + 5-6 Theban Mummies, Men... ~— +: 0-4 
English Men (Whitechapel)... + 4:2 Gaboon Men (1864) ... = = +00 
Moriori Men ... on Se OLD Kaffir Men... saa .. + 0:0 
Moriori Women oe tee ELS 2 Australian Men ons fe AO2 
Congo Men ... aa .- +26 Coptic Men... we - = 0:2 
Australian Women ... we = OF 24 Prehistoric Egyptian (N waddle) 

Long Barrow Women Seem te teat Men .. — 0:3 
Long Barrow Men ... eis + 22 Gaboon Women (1880) ite — 0-4 
Fuegian Men SH, eee ean) Zulu Men ae: Sor Sore ee EE 
Congo Woment Se tee, lO Angoni Men ... ore ae ele? 
Theban Mummies, Women... + 18 Negro Men (North Africa) ... — 13 
Prehistoric Egyptian (Naqad a) 

Women ... es eo at LO 


Judged by this index the Moriori stand for excess of breadth over height per 
“unit of length at the top of all the early and primitive races and nearest to modern 
Europeans, and this is a feature which differentiates them from Maori and 


* loc. cit. p. 25. 
{ Erroneously placed as — 1-9 in Biometrika, Vol. vit. p. 307. 
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Australians and markedly from the full-blooded negro races. Scott continues: 
“the orbits are as a rule high, and the appearance of height is increased by the 
form of the superciliary ridges; while the nasal opening is narrow*.” Table III 
(p. 98) will show that the orbits (O,) are higher than any of the comparative material 
provided, although the Fuegians come fairly close and the Maori follow them, 
but they are distinctly higher than modern European races and the Negroes. On 
the other hand the orbits are distinctly broader than those of any other race, and 
the result is that relatively to the breadth they about equal the Fuegians in orbit, 
fall behind the Maori and about equal the Negroes and Bavarians. It would probably 
be better to state that Moriori orbits are large rather than high. Again the state- 
ment that the Moriori nasal opening is narrow requires qualification. Its absolute 
breadth is about the same as Maori, Fuegian and Aino, and in excess of modern 
Kuropeans. It will be found that it is the nasal height of the Moriori which is 
the noteworthy absolute feature of their noses, and the result is that their nasal 
index falls below that of any of the races placed in Table III for comparison. This 
is markedly true of the males. 

Of general descriptive properties Dr Scott says there were no metopic crania 
in his series, that the coronal and sagittal sutures were simple, but the lambdoid 
often very complicated; these remarks would apply well to the Royal College of 
Surgeons’ collection, although two of the skulls of Moriori children at Berlin are, 
as might well be, metopic. The Moriori cranium does, indeed, strike one as 
“primitive,” but in a different sense to the Australian cranium. A comparison 
of the following table will show how very little resemblance there is between the 
Australian and Moriori crania: 


Australians (Robertsont+) Moriori (Thomson) 
Character ee : AaTe 
3 (78) Q (22) 3 (34) 9 (22) 
Maximum Length (LZ) ... 183-56 175-59 186-87 178-60 
Maximum Breadth (B)... 130-60 128-73 141-40 138-41 
Basio-apical Height (H) 131-13 124-50 135-91 132-81 
100 B/L ... Si ne 71-29 73°36 76-07 77-52 
100 A/L ... ane Son 71-50 70-95 72-78 74:52 
100 A/B ... ace eae 100-50 96-64 96-12 95-95 
——~-! 


It is clear that the Moriori is a much larger skull than the aboriginal Australian, | 
and of wholly different proportions. And yet those who have handled the Moriori 
skull cannot doubt that in its special manner it is as primitive as the Australian. 
The very bulging glabella, the low receding forehead, the considerable flattening 


* loc. cit. p. 25. It should be noted that Scott means these remarks to apply to his first type, but 
he admits the continuity of his first with his second type (p. 26). = 

+ Proceedings R. S. Edinburgh, 1910—1911, Vol. xxx1. pp. 1—16. Craniological Observations on 
the Length, Breadth and Height of 100 Australian Aboriginal Crania. 
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on either side of the sagittal suture giving rise to the sagittal crest, combined with 
great weight strike one at once. Anatomical peculiarities as will be noticed from 
my “Remarks” are frequent. Thus flattening of the obelion, parietal bulging, 
ossicles of the lambda are common; epipteric ossicles were noted in 17 out of the 
63 crania. There were four cases of asymmetric foramina of which the most 
pronounced is photographed in Plate XIII. Marked inions or inionic ridges occur 
in several cases, while of rarer anomalies I noted an incisura crotaphitico-buccinatorus 
in one female, an ossicle of the coronal suture (Plate XV), and para-occipital 
processes articulating with the axis in other cases. Of non-congenital deformities, 
perhaps the most remarkable is a healed fracture of the mandible on the left side 
(Plates XX—-XXIT). Extreme attrition of teeth* was also noticed in a large 
proportion of the crania in this collection, a feature on which Dr Duckworth has 
already laid great stress in his paper. The general asymmetry of the skulls will 
be well realised, not only from a perusal of the remarks on the individual crania, 
but also by examining the measurements right and left given on the horizontal 
and transverse type contours. To sum up in the language of Flower and Turner, 
the Moriori are a megacephalic, mesatricephalic, metriocephalic, mesoseme (although 
close to the borders of microseme), leptorhine in male, mesorbhine in female, ortho- 
gnathous, brachyuranic} race. They are thus mediocre in most cranial characters, 
but their non-mediocre features, those of megacephaly, orthognathy and brachy- 
urany, associate them rather with the more highly civilised than with the low 
races. As we have already said, the “ primitive” character of the Moriori cranium 
does not lie in the same characters as differentiate the Australian aborigines or 
even the Negro from the European. It is a type of primitiveness hardly described 
by such characters as we have studied above. It is, perhaps, better considered 
in reference to the contours. 


* A similar attrition of the teeth was noticed in the prehistoric Egyptians and has been attributed 
by some to the use of stone mortars for grinding the grain. 

f Scott (loc. cit. p. 4) defines palato-maxillary index as 100 x palato-maxillary length/palato- 
maxillary breadth. But his lengths are always less than his breadths, just as are Turner’s corresponding 
values for the four Moriori skulls acquired on the Challenger Expedition. Scott has accordingly, since 
his indices are over 100, really used, like Turner, the ratio of breadth to length. Turner following Flower 
measures the length as the distance from the alveolar point to the line drawn between the most posterior 
parts of the maxillary tuberosities. This is not exactly our palate length, but appears close to it in 
value. On the other hand Scott and Turner include in their palate breadth the alveolar arch and thus 
get a breadth very largely in excess of ours, and, we consider, no true palate breadth at all. To allow 
for the breadth of the alveolar arch, we have supposed it proportional to the palate breadth as measured 
in our manner. But we can only compare our breadth with Scott’s breadth for our two series as we 
have not the two measurements in either case. There results for the mean breadth of the (double) 
alveolar arch at the second molars: 

Moriori males, 26-47, Moriori females, 22-79. 
We thus deduce for our Moriori the palato-maxillary index of Turner as 123 for males and 119 for 
females corresponding to the brachyuranic group. If we reverse the process, altering the above 
values in the ratio of Scott’s palatal breadth for Maori to his palatal breadth for Moriori, i.e. subtract 
26-47 x 62-77/65-32 from 62-77 and 22-79 x 59-40/59-89 from 59-40 for males and females respectively, 
we obtain the values for the palatal breadths on our plan, i.e. 37-33 and 36-80 recorded in brackets under 
Maori in Table III, and the corresponding values of the palatal index. 
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TABLE III. Comparison of Moriori and other Races in Cranial Characters. 


Moriori Maori Aino Fuegian Oh aas) Neeen 
Character s 
3 (c. 34) | 2 (c. 22) | o (c. 43) | 2 (c. 20) | d (c. 88) | 9.(c. 63) | d (c. 34) |  (c. 24) | g (c. 98) IP (c. 128) | g (c. 39) 
C 1498 1378 1476 1288 1462 1308 [1474] | [1290] 1381 1288 1348 | 
L aoc -. | 186-9 178-6 185-5 175-6 185-8 177-2 192-0 185-0 185-1 177-5 182-9 |~ 
B as .. | 141-4 138-4 140-1 134-8 141-2 136-8 145-0 | 141-0 134-9 131-5 133-2 
B’ Mee 308 95-3 94-0 95-7 91-2 96-2 92-4 97-1 91-0 91-1 88-2 95-9 
H a reek eooe)) 132-8 137-6 133-0 139-5 135-1 141-0 137-0 135-2 129-5 135-3 
OH ise noge | allel 115-1 -= — 119-3 115-0 — —- 115-5 113-1 115-0 
LB ae -- | 105-6 102-3 103-9 100-5 105-4 100-3 | {104-5} | {96-3} 99-3 94-9 103-0 
TEML ee an || abr) 84°°5 — — 82°-0 81°-0 {82°-0 | Sergi} 84°-4 84°-5 82°-0 
Q ae cen soLO 310-0 307-5 293-8 328-5 317-1 | {316-5} | {298-6} | 304-2 | 296-5 306-8 
S aie «| 368-5 354-4 378-1 361-8 372-8 360-7 | {390-0} | {370-0} | 373-0 363-6 367-7 
U sek es | 522-8 502-6 518-0 496-8 522-5 501-7 531-0 502-0 511-0 493-7 508-5 
Ga: pet 70-6 69-6 71-4 64:4 69-8 65-5 {71-3} {69-7} 67-6 65-8 68-0 
GB a + | 103-2 96-6 — — 102-1] 96-7 | [110-02]| [103-2?]) 95-9 92-5 96-6 
J ae wae | 2187-4 128-9 136-8 127-9 137:3 129-7 143-6 132-1 125-6 117-0 129-5 
INE eee ot 57:3 52°5 53-8 49-4 50-5 47-4 {54-0} {49-6} 48-9 46-7 49-5 
NB Sets Ae 25:3 25:3 25-6 23-4 25-6 24-7 {26-3} {23-8} 25-1 24-3 27:3 
OL ae sie 44-4 42-4 40-5 39-2 40-9 39-8 {42-5} {40-3 38-1 37:2 39-6 
OL cee See 37:3 35°5 35-0 33°6 34-9 33-9 {36-0} | {34-7 32:3 32-0 34:2 
Gy se oe 53-1 50:5 52-4 48-5 53-0 51-4 | [{56-0}] | [{54-0}] | 55-8 53-9 49-4 
Gs, ae wae 38-9 37-1 [37-3] | [86-8] 38-2 37-4 | [{43-0}] | [{39-0!]] 40-3 38-9 39:7 
GL sie »s | 100-9 97-4 100-8 96-9 104-9 100-1 | {100-0} {96-0} 94-7 91-0 105-3 
In ao Pca. lle tays 34-4 35-2 33-1 — | o— {36:0} | {37-3} ae rE a 
Unb “ee 308 30-2 30:8 29-0 — — (31-0; | {31-7} im =< = 
100 B/L ee 76-1 77-5 75:4 74-7 76:5 77-6 76-6 | 77-7 73-0 74:2 7 
100 A/L zie 72:8 74:5 74:7 74-2 75:6 76-7 73-4 72:9 73°3 73-2 74- 
100 B/H . | 1045 103-6 | [101-8] | [101-4] | 101-24 | 101-24 | [102-8] | [102-9] 99-8 101-6 98- 
100 NB/NH ... 43-9 48-2 47-9 49-1 50:7 50-5 {48-5} {47-9} 51-1 52°3 55-4 
100 0,/0, ae 84-0 84:3 86-1 86-2 85:3 85-2 {84-6} | {85-0} 84-8 86-0 ST 2a 
100 G,/G, es 74:1 72-1 [71-2] | [75-9] 72-1 72:8 [{76-8'] | [{72-2]} 71-9 72:3 81:5 
100 fmb/fml... 87-4 87:8 87-4 88-6 — — 89-0 84:2 — = — 
Gnathic Index [95-6] | [95-2] 97-0 96-4 [99-5] | [99-8] 96-2 | 99-3 [95-4] | [95-9] | [102-3] 
Congo Gaboon English Long Bavarian 
Bantus Bantus (Whitechapel) Barrow (Alt-bayerische) 
Character 
3 (c. 50) | 9 (c. 27) | g (c.50) | P(e. 44) |  (c. 72) | 2 (c. 80) | g (c. 12) | 2 (e. 8) |B (c. 100) | 9 (ec. 100) 
C 1344 1206 1381 1232 1477 1300 — — 1504 1336 
L eo She) alerts! 171-4 179-5 171-6 189-1 180-4 190-6 182-6 180-6 173-5 
B AS; w. | 138-5 130-9 135-5 130-6 140-7 134-7 142-4 138-6 150-5 144-0 
B ees a 97-5 92-6 96-5 92-8 98-0 93-1 98-9 94-1 103-7 96-3 
H 133-8 127-5 135-4 130-1 132-0 124-6 137-8 135-1 133-8 128-0 
OH] sae} 3:9 109-0 — — 114-6 109-2 120-7 118-0 | [111-5?]) [109-02] 
IB fe <8 98-2 93-3 100-3 95-8 101-6 95:3 102-0 96-8 100-3 96-3 
Pie 82°-8 81°-1 — — 86°-1 87°-1 83°-0 2 89°-1 88°-8 
Q 310-7 298-2 310-5 296-3 307-9 294-0 321-8 312-0 329-7 318-7 
S 361-8 349-8 364:3 350-4 377-1 362-8 384-8 382-0 365-1 353-4 
U Aer .. | 504-3 489-3 501-2 481-2 524-3 503-8 534-9 518-7 524-4 501-4 
GT, a8 siete 63-4 59-3 66-4 62-7 70-2 65-9 69-9 66-8 70-8 66-8 
GB See ae 94-8 92-2 95-6 90-9 90-9 84-9 95-9 92-8 95-1 89-7 
J Hee vee el2625 122-1 129-0 120-5 130-1 120-3 134-0 132-5 135-0 126-3 
INEEL sae 47-2 45-0 48-2 45-1 51-2 48-7 49-4 47-0 50-9 48-2 
NB © os ae 26-0 25-0 26-6 24-7 24:3 23-2 24-1 22:8 24-8 23-7 
OUP 5. ae 40-4 40-5 41-5 39-6 43:1 41-2 38-1 36-9 39-9 38:3 
OL... an 34-6 33-4 35-0 33:7 33°5 33-6 31:8 30-6 33-7 33-2 
Gy a aes 50-3 50-2 52-1 50:1 48-3 45-1 55:3 57-0 [44:3?]| [42-22] 
G, oe ae 38:9 35-9 39-5 37-4 36:8 35-2 43-7 {36-0} | [33-22?]] [32-12] 
GL aon me 96-5 93-9 100-5 98:5 | 95-9 90-4 95:3 92-6 — — 
fml sae ae 35-9 35:3 35:9 34-0 35-4* 34:3 * 35-7 34:5 — — 
{mb ... Dy 30-0 ave 30-0 28-1 29-7 * 29-0* 27-7 30:3 _ _— 
100 B/L ae 78-0 76-5 75:5 76-2 74:3 74-7 74:9 76:3 83-2 83-1 
100 H/L ais 75:4 74:6 75:5 75:7 70-0 69-1 72:7 74-0 74-2 73-9 
100 B/H ... | 103-6 102-6 100-1 100-7 106-3 108-5 | [103-3] | [102-6] | 112-5 112-5 
100 NB/NH ... 55-2 55:8 55:3 54:9 47-6 47-8 49-0 49-1 48-7 49-2 
100 0,/0, sie 85:7 82-5 84:6 85-2 77-9 81-7 83-4 81-7 84-5 86-6 
100 G./G, Mie 76:8 70-8 76-1 74:8 76-3 77-7 [79-0] | [{63-2}]|} [74:4] | [76-0] 
100 fmb/fml... 83-5 81:8 83-9 81-2 84-2* 84-5* | [77-7] | [87-7] — — 
| Gnathic Index [98-2] | [100-6] | [100-4] | [101-8] [94:4] | [94-8] | [99-4] [99-9] —_ —_— | 
| | 


* Values for English (Moorfields). 
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5. Comparison of Moriori cranial Characters with those of other Races. In TableIII 
I have gathered together such material as seemed of interest for comparative 
purposes. Unfortunately I have not been able to obtain a good series of Australian 
measurements for comparison, but the values of the few characters given by 
Robertson for fairly long series, show that there is little relationship between the 
two races. In the first place we put Scott’s Maori values. We follow these by 
data for the Aino, the means being calculated from the measurements in Koganei’s* 
Tables; then such slender results as we have been able to obtain for the Fuegiansf. 
Lastly we place three Negro races, North Africans{ and Benington’s Congo and 
Gaboon measurements§, to complete the series of non-European primitive races. 
For European races we give English|| and Bavarians§] as modern representatives, 
and the Long Barrow ** as an ancient race, unfortunately with very slender numbers. 
Finally the Egyptians find representation in Fawcett’s Naqada measurements ff. 


In Table III we have been obliged to make various assumptions to deduce 
certain of the values provided. They are either not provided by the authorities 
cited, or have been differently measured. The difficulties are greatest when we 
come to deal with orbital and palate measurements, where most divergence of 
method has existed. We have in certain cases had to aliow for length of spine 
or breadth of alveolar arch as obtained from other cranial series. Again some 
authorities do not record the index {{ 100 B/H or the gnathic index, and these have 
had to be supplied from the ratio of mean values. AIl such measurements are 
enclosed in square brackets, and when especially doubtful are marked by queries. 
In other cases where the numbers dealt with are quite inadequate the values are 
enclosed in curled brackets. We cannot in the present paper deal at length with 
the basis of these corrections and modifications but a few words as to our treatment 
of Fuegian data may make clear certain points. The fundamental memoir here 
is that of Rudolf Martin. He had only five Fuegian crania, but before his date, 


* “Beitrige zur physischen Anthropologie der Aino.” Mittheilungen aus der medicinischen Facultat 
der k. j. Universitat. Tokio, 1894, Bd. 11. Reduced in 1900 in the Biometric Laboratory. 

+ Rudolph Martin, “Zur physischen Anthropologie der Feuerlainder,” Zeitschrift fiir Anthropologie, 
Bd. xxi. 8. 155—218, 1893. J. V. Hultkrantz, “Zur Osteologie der Ona und Yahgan Indianer des 
Feuerlandes,”’ Wissenschaftliche Ergebnisse der schwedischen Expedition nach den Magellanlindern 
1895—97 unter Leitung von O. Nordenskjoeld. Stockholm, 1900. 

{ Reduced in the Biometric Laboratory in 1900 from data in the German Anthropological Catalogues. 

§ R. Crewdson Benington, ‘“‘A Study of the Negro skull with special reference to the Congo and 
Gaboon Crania.” Biometrika, Vol. vitt. pp. 292—337, 1912. 

|| Results for Whitechapel 17th century English used; see W. R. Macdonell, ‘“‘A Study of the 
Variation and Correlation of the Human Skull, with special reference to English Crania.”” Biometrika, 
Vol. m1. pp. 191—244, 1904. 

4] Reduced in Biometric Laboratory in 1900 from the data in J. Ranke’s paper, “ Beitrage zur 
physischen Anthropologie der Baiern,”’ Bd. 1. 

** Edgar H. J. Schuster, “Long Barrow and Round Barrow Skulls in the Oxford Museum.” 
Biometrika, Vol. tv. pp. 351—362, 1905. 

tt C. D. Fawcett, “A Second Study of the Variation and Correlation of the Human Skull with 
special reference to the Naqada Crania.” Biometrika, Vol. 1. pp. 408—465, 1902. 

tt Unfortunately in the tables (Biometrika, Vol. 1. pp. 426—7) Fawcett has for the two races Bavarian 
and Aino ($ and 9) the values of 100 H/B put under 100 B/H. 
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1893, Mantegazza had dealt with 16, Sergi with 14, Garson with 8, Hyades with 
8, Sir William Turner with 4 and Quatrefages with 2 skulls. Since his date 
Hultkrantz has had six additional specimens to measure. . Thus there are really 
between 60 and 70 Fuegian crania certainly available and no doubt if further 
search were made we might run the value up to nearly 100, and get substantial 
and important results. Unfortunately some of the writers referred to made very 
inadequate series of measurements, even on the few crania they had at their 
command. Martin puts together a good deal of their measurements, but apparently 
thinks it sufficient to average their averages. This is somewhat dangerous con- 
sidering the large differences in the numbers they have dealt with. He does not 
also, we think, sufficiently emphasise the wide divergencies to be expected when 
means are based on 2 to 10 crania. We have endeavoured to weight the results 
of the various recorders and such are the numbers given without curled brackets 
in the Fuegian column. The values in curled brackets are based solely on 
Martin’s 2g and 39 crania. The statistically trained mind would say that they 
are worthless as a determination of type. With this statement we should not fail 
to concur, did not the means provided by Hultkrantz for his six skulls combined 
accord so closely with our results for male Fuegians as to force us to the conclusion 
that his crania were practically all males and that by a lucky chance we have 
reached even with such miserably slender material results closely approximating 
to a true Fuegian type*. 


Character Table III ¢ | Hultkrantz Character Table III g | Hultkrantz 
Capacity : = 1474 1443 Nasal Height ae 54-0 54-6 
Maximum Length aa 192-0 191-5 Nasal Breadth Ac 26-3 25:3 
Maximum Breadth os 145-0 146-1 Orbital Breadth ... 42-5 40-0 
Basio-bregmatic Height 141-0 140-6 Orbital Height 300 36-0 35:3 
Minimum Frontal Breadth 97-1 98-0 Orbital Index ae 84-6 87-4 
Profile Angle ... é 82°-0 83°-6 Cephalic Index Rae 76-6 76:3 
Horizontal Circumference 531-0 531-0 Height Index 73-4 72:1 
Sagittal Circumference . 390-0 381-0 Breadth-Height Index 102-8 103-9 
Transverse Circumference 316-5 316-1 Foraminal Length ... 36-0 35:3 
Bizygomatic Breadth ... 143-6 143-5 Foraminal Breadth ... 31-0 31-0 
Profile Length ... Ste 100-0 106-0 Foraminal Index... 89-0 87-9 p 


We see how remarkably alike the results are even for such short series, and there 
is sufficient indication that with uniform methods of measurement the existing 
Fuegian crania would suffice to provide a quite good typical series of values. 


Turning to examine Table III a little more closely, we see that we can dismiss 
any relation to the Negro race, either in Sudanese or Bantu form. Egyptians 
and central Europeans show no affinities. It is only with the English or the Long 
Barrow crania that there is any approach, and the latter are so slender in number 


* A more thorough collection and examination of all available Fuegian data will shortly be under- 
taken. 
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that it seems idle to deal in more detail with them. On the other hand there is 
undoubted affinity to the Maori; and rather surprising resemblances to the Aino 
and to the Fuegians are worth taking into consideration. The full discussion of 
the proper method of representing racial affinity by a single coefficient must remain 
undiscussed at present. A rough measure will only be used here; we take the 
mean percentage difference of each race, for males only, on the basis of the Moriori 
means, and then consider the mean of these percentages as a working coefficient. 


Table IV gives these percentage differences to a single decimal place, and we 
note at once the following points: 

(i) While the Maori are nearest of the races considered to the Moriori, both 
Aino and Fuegian are surprisingly close. 

(ii) Seventeenth century English are more distant than the Fuegians, but 
have very far from the deviation of a race like the Sudanese Negroes (i.e. 3-92 
as compared with 5-80). 

(iii) The current view that the Moriori are like the Maori an intermixture of 
Polynesian and Melanesian stock scarcely receives confirmation from Table IV; 
the Moriori stand nearer to the Londoner of the 17th century than to the Negro. 


(iv) The Maori themselves are still more closely related to the Fuegian (2-87 
as against 3-13) than the Moriori. 

Thus it is difficult to believe that there has not at some time been a race-link 
between the present denizens of extreme South America and the denizens of 
extreme Polynesia. 


It is very regrettable that our data for the Long Barrow inhabitants of Britain 
are so slender. Macdonell has already pointed out their relative closeness to 17th 
century Londoners and the work of Crewdson Benington* seems to show that the 
English commonalty of to-day is one with the 17th century Londoners. We have 
to look to the bulk of the English population to-day as being non-Teutonic in 
origin, and much talk of our Germanic relationship may ultimately prove to be 
idle. Craniologically the English commonalty belong in bulk to an earlier and 
probably more primitive race. It would be unprofitable to speculate until we 
have larger series and more complete measurements—especially of Polynesian, 
Melanesian and American Indian crania—on the meaning of the results suggested 
in the present comparison between the denizens of the extreme fringes of enormous 
areas. They give one at least a suggestion of vast folk-wanderings in a world 
possibly of different conformation. Practically they warn us at least to consider 
the problem of the relationship of the American Indian to either a north-Asiatic 
or a Polynesian stock as still unsettled}. Even the English relationship shows 
that the European origin of the American is still a workable hypothesis. We 
have confidence that if our craniological series consisted of hundreds instead of 
often units, statistical methods would enable us to construct a racial human 


* Biometrika, Vol. vu. p. 131. 
{ It is in our opinion impossible to accept the views of either Brinton or Hale as final. 


102 A Study of the Crania of the Moriort 


pedigree which would throw immense additional light on human descent. The 
first desideratum at the present time is the collection of crania from what we may 
call “fringe” races, and the second desideratum the construction of type crania, 
both by mean measurements and by contours, of the material thus collected. 
Only then will the science of craniology become a really useful guide to the 
anthropologist. At present arguments based on half-a-dozen crania—unaccom- 
panied by any previous study of cranial variability—are of the most slender weight 
and should simply be disregarded by the anthropologist *. 


TABLE IV. Comparison of Male Crania of Moriori with those of other Races. 


Percentage excess value of Moriori character over: 


Character Northern 
negro 


Maori Aino Fuegian English 


Capacity ... oe 
Maximum Length 
Maximum Breadth 
Minimum Frontal Breadth 
Basio-bregmatic Height... 
Auricular Height 

Cranial Base 

Profile angle ae 
Transverse Arc ... ae 4 
Sagittal Are B06 568 
Horizontal Circumferenc 4 
Upper Face Height 
Face Breadth : 
Bizygomatic Breadth 
Nasal Height 

Nasal Breadth 
Orbital Length ... 
Orbital Breadth ... 
Palate Length 

Palate Breadth 
Profile Length 
Foraminal Length 
Foraminal Breadth 
Cephalic Index 
Vertical Index ... 
Breadth-Height Index 
Nasal Index 

Orbital Index 

Palatal Index 
Foraminal Index... 
Gnathic Index 
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Mean Percentage Deviation 2-31 3:00 3:13 3:92 5:80 


* Tt is astounding to note what wide reaching results even the examination of a mere half-dozen 
Moriori, Fijian, Maori or Eskimo crania led the older craniologists to propound ! 
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6. Study of the Moriort Type Contours. The purposes served by these contours 
are (i) the comparison with the like contours of other races, and (ii) their use as a 
method for actually determining the mean values of certain characters. We shall 
consider them from both aspects. 


For the first purpose the sole available material consists in the contours for 
Bantu, Egyptian (XXVI—XXX Dyn.), English, Eskimo and Guanche crania 
drawn to scale and published with tracings in Crewdson Benington’s paper*. 
Some attempts to obtain type contours by other processes are either based on 
inadequate numbers, are deduced from far too few plotted points by an unsuitable 
method, or are reproduced to no standard, or at any rate to no easily comparable 
scale. For a type contour to be really valuable it should be drawn to full-size and 
published not only on ordinary but also on transparent paper. It is only in this 
form that it can be of real value for comparative purposes. Further the contour 
in the case of the sagittal section should contain the basal triangle and the occipital 
region. These were included in Dr Benington’s individual contours, but un- 
fortunately he omitted to add their mean values to his type contour. We have 
included them in the Moriori contours. 


Starting with the transverse section, we note that the length of the inter- 
auricular line is relatively large as compared with the Bantu or Egyptian crania; 
it is more nearly approached by those of the Guanche or 17th century English, 
which, however, still fall short of it. The Eskimo transverse section, with a some- 
what larger interauricular diameter, has near the “apex” a more marked sagittal 
crest; this crest so marked in the Moriori crania is, however, thrown as a rule further 
back in their case. If we consider a primitive form like the Cro-Magnon which 
has an interauricular length nearly as large as the Moriori type skull, its auricular 
height is so great that the Moriori transverse section falls entirely inside it. The 
following table suggests the nature of these results, and confirms the conclusion 


Measurements on the Transverse Type Section}, 3s. 


Race Interauricular Length Auricular Height Index 
Moriori ae a 125-2 109-5 87:5 
Eskimo 300 onic 128-0 114-2 89-2 
English 509 a 122-4 111-5 91-1 
Guanche ie Sat 119-6 109-8 91-8 
Egyptian... ae 117-4 110-2 93-9 
Congo, Bantut ae 116-8 113-6 97-3 
Cro-Magnon... a6 124-0 | 121-0 97-6 


* Biometrika, Vol. vu. p. 123. 

{ These measurements are from the type sections; it must be carefully borne in mind that they 
may differ from measurements of the same name on the crania themselves. 

t Fernand Vaz, 1864 Series. 
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previously reached, i.e. that the Moriori cranium approaches to a certain European 


type represented by modern English and by prehistoric Guanche* far more closely 
than it does to the Negro skull. 


As measured on the actual skull Scott obtains values of the interauricular 
length which give a male mean = 126-8, and we have obtained directly for the 
auricular height of our series of males 117-1; the reasons for these divergencies will 
be considered later. It would have been of the greatest value had we possessed 
type sections of the Aino and Fuegian crania for further comparison. 


Let us now consider the horizontal section. This as we have stated is taken 
through the glabella, while the cranium is arranged so that the plane of the drawing 
is parallel to the standard horizontal plane. Now the glabella is not a well-defined 
point, but rather a small area, and a little personal equation will make a con- 
siderable change in appearance of the horizontal section according as the plane 
of section recedes very little from the zygomatic and approaches slightly to the 
stephanic region. But as the horizontal sections discussed in the present paper 
are all drawn by members of the same school we are fairly certain that the con- 
cavity in the contour behind the temporal line is far more marked in the Moriori 
than in other crania we have drawn. But it has seemed to us—in view of the 
importance of the temporal lines from the evolutionary standpoint—desirable in 
future to take additional coordinates for the points, Tz and T+, where the section 
cuts the temporal line on the frontal bone. Otherwise these important points are 
lost in the general averaging out of the contours. Allowing, however, for some 
difference of treatment our contours as well as our plates (see especially Plates II, 
III, VII, IX, ete.) indicate how marked is the linea temporalis of the Moriori 
skull, and we think that this is an essential factor of our appreciation of its 
primitive character. If the reader will compare our Moriori horizontal contour 
with those of the Eskimo, Bantu and English crania, he will realise how relatively 
great is the depression of the facies temporalis in the Moriori frontal bone. 
A similar opinion will be formed by comparing the photographs of English 
crania on Plates II and V of Macdonell’s memoir{ with the corresponding photo- 
graphs of Moriori crania on Plates IX, XV, XVI and XXI, where the shadows 
indicate the massive character of the linea temporalis. This depression of the 
facies temporalis so noteworthy in the Moriori cranium is recognisable in the Cro- 
Magnon contour, but scarcely more than indicated in Egyptian, Congo and English 
crania. Now if the reader will superpose the English type horizontal section§ 
on the Moriori so that the point V falls on F and O on O, he will perceive that it 


* The resemblance of 17th century English to the Guanche type has already been insisted on: 
see Biometrika, Vol. vii. p. 135. 

+ The general asymmetry of the Moriori cranium is well illustrated by the fact that 7’, lies posteriorly 
to 7, in the type cranium: anteriorly the ordinates show that the left side, posteriorly the right side 
of the Moriori skull is larger. 

t Biometrika, Vol. 1m. p. 191 et seq. 

§ Tracing Fig. XXIX of Crewdson Benington’s paper, to which the other figures when attached 
to tracings refer. 
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is at ordinates 3 and 4 that the maximum difference takes place; it is the depression 
of the facies temporalis which is the marked differential feature of the two contours. 
If the Guanche contour (Fig. XXVI) be superposed in the same manner, the 
depression is substantial, but not so marked. When we turn to the Kgyptian and 
Congo contours (Figs. XX and XI), we must adopt a somewhat different super- 
position. We place O on O and OF in the direction OF. We find at once that 
the posterior portions of these contours fit surprisingly close to the Moriori contour, 
but that the massive frontal development of the Moriori is completely lacking. 
To bring out this special feature of the Moriori we may form a Temporal Index 
by measuring the percentage that Ordinate 3 is of the length of the section. We 
have the following results: 


Ordinate 3 


Race Temporal Index MOS Minimuin Rrontallreadth 


Moriori 50-9 99-1 
Cro-Magnon 55-7 = 
Egyptian ... 56-2 112-0 
Eskimo 57-2 — 
Congo, Bantu * 57-5 | 104-8 
Guanche 58-3 — 
English 58-8 111-6 


There are clearly somewhat mixed factors at work here, but the essentially 
differentiated character of the Moriori cranium is well-marked. It is the develop- 
ment of the fronto-temporal regions which especially distinguishes the crania of 
the English commonalty and the Guanche crania from the Moriori. 


We now turn to the sagittal contours, where, as we have already remarked, we 
have not unfortunately the basal triangle and occipital portion of the Benington 
contours for full comparison. The first problem is: Which standard line is to 
be superposed? We have actually the inion, the lambda, the bregma and the 
nasion as definite points, and the glabella and occipital as less definite localities. 
Schwalbe takes the glabella-inion line, Klaatsch the glabella-lambda line; we 
may also take the nasio-lambda line, or the nasio-gamma line, which is a line 
through the nasion parallel to the Frankfurt standard horizontal plane, meeting the 
occipital iny. Personally we doubt the value of any line drawn to either glabella or 
inion, as applied to the individual cranium; the terminals are too vague, although 
doubtless when we deal with a type contour, these so-called points average out 
to fairly reliable positions. The glabella-lambda line has been asserted to be 
nearly parallel to the standard horizontal planet, but we know of no properly 
drawn type contour in which the height of the lambda is not two to three times 
as great as the height of the glabella above the nasio-gamma line. In default of 


* Fernand Vaz, 1864 Series. 
Tt Quain’s Elements of Anatomy, Vol. tv. Part 1. p. 132. 
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other points on defective portions of crania either glabella or inion may be used. 
About the standard horizontal plane three statements may be made in the case 
of type crania: 


(i) It passes very nearly through, but a little below the inion. 


(ii) It is approximately parallel to the basio-opisthion line, i.e. parallel to the 
plane of the foramen magnum. 


(ii) It is approximately parallel to the maximum length, i.e. the glabella- 
occipital line. 


None of these statements are of course true of the individual skull, but on an 
average or in type cranial contours they are nearly true as the reader will observe 
if he examines the Moriori type contours. 


If we put aside the glabella as an indefinite “point,” and the inion as not 
available in our comparative contours, we are left with three available lines for 
superposition: (i) NB, the nasio-bregmatic line; (11) NA, the nasio-lambda line; 
and (iu) Ny, the nasio-gamma or standard horizontal. We will examine the 
comparison by each of these lines as superposition lines. Let us first, however. 
recall to mind the essential features of the Moriori skull as revealed by measure- 
ment, simple inspection and photograph. These are the flattening of the frontal 
region and the sagittal ridge or crest behind the apex. We superpose first the 
English typical sagittal contour (Fig. XXVII). Nasion on nasion and the lines 
Ny in the same direction. We have at once brought out the Moriori characters, 
the flattening of the frontal and the sagittal crest starting at the bregma or before, 

eaching behind the apex and extending towards the lambda. There is no question 
of these striking features. Rotate the tracing until NA coincides in direction with 
NX, all trace of the crest disappears, there is increased frontal flattening, but it is 
the English skull which now might be thought crested! Now make N8 coincide with 
Np. Here we get a really true measure of the flattening of the frontal bone, but 
practically the English and Moriori contours now coincide from bregma to lambda. 
In our opinion there is not the least doubt that comparison by the nasio-gamma 
line, not the nasio-lambda line, brings out the characteristic features of the Moriori 
cranium in a manner which fails entirely when we use the nasio-lambda line. 
Even in the question of the appreciation of frontal flattening, it is by no means 
certain that the flattening of the frontal bone is the sole source of the physiognomic 
impression; the rotation of the frontal bone as a whole (which would not alter 
the ratio of subtense to nasio-bregmatic chord (see p. 110)) may have equal physio- 
gnomic value. In the present case this physiognomic factor gives an aspect of 
flatness to the English skull, although there is a high value of the nasio-bregmatic 
frontal index. Anyone looking at the photographs in Macdonell’s memoirs on 
the English skull* would assert that there is physiognomically a flattened frontal. 
It is largely produced by rotation of the frontal bone and not by its flattening. 


* See Biometrika, Vol. v. p. 104, Plates XII, XVI, etc. 
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All these points are well brought out when we compare by the standard horizontal 
planes and have no suggestiveness at all when we superpose the nasio-lambda 
lines, which give the English skull a protrusion of the frontal both by rotation of 
NB and curving of the frontal bone, while actually the two factors act in opposed 
directions. 


Another point which tells considerably against the use of the nasio-lambda 
line as reference line in the case of two type contours is the experience that this 
superposition usually throws out very much the two basal triangles (i.e. nasion, 
basion, alveolar point). Asa matter of fact when the basal triangles are as nearly 
as possible superposed, then the nasio-bregma lines will be closest together, next 
the nasio-gamma lines and last the nasio-lambda lines, but sufficient material 
is not yet available to study properly this point, and accordingly no stress is laid 
on it here. 


If the Guanche tracing (Fig. XXIV) be superposed on the Moriori, N on N 
and Ny along Ny, we have the same result, the frontal flattening and the post- 
bregmatic crest; this is equally so if NB be set to NB, but the crest disappears 
entirely, if we set NA to NA. When we compare the Negro contour (Congo Bantu, 
Fig. [X) with the Moriori we have still, for Ny to Ny, the flattening of the frontal 
(NB almost coincides with NB) and the emphasis of the crest, but a new feature 
comes in, namely the dwarfing of the occipital portion of the Negro cranium. To 
a less marked extent this occurs with the Egyptian type contour (Fig. XVIII), 
but the emphatic flattening of the Moriori frontal bones appears reduced in this 
case by the physiognomic flattening of the Egyptian due to rotation of the frontal 
bone. 


From other aspects there are considerable objections to the use of NA as a 
reference line especially in the case of individual crania. While the suborbital and 
auricular points are capable of fairly accurate determination when the skull is 
on the craniophor and the bregma likewise as the coronal suture is not usually 
deeply serrated, the lambdoid suture is often most markedly serrated and the 
actual position of the lambda may be the result of a random serration, which 

appears to have little bearing on the average sweep of the lambdoid suture, and 
the intersection of that sweep with the sagittal suture. 


It is not possible to compare the constants of the type sagittal section of the 
Moriori as found for Schwalbe’s glabella-inion line with the like constants for the 
races dealt with in Benington’s paper, for the position of the inion has not been 
averaged off from his individual contours. We shall accordingly content ourselves 
with comparison of the indices obtained from bregmatic subtense and maximum 
subtense on the nasio-gamma, the nasio-lambda and Klaatsch’s glabella-inion 
lines as base, giving further the angles which the nasio-bregmatic or glabella- 
bregmatic line as the case may be makes with these base lines. We shall still 
have to confine our attention to male crania as the Benington paper has not provided 
the contours for female crania. 
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Table V results: 


TABLE V. 


Values of Constants of Sagittal Contours. 


A. Nasio-gamma Base Line. 


; | A (Ind. Max. 
Base Max. Index Index 

Line Subtense pp uneae Max. Sub. B Sub aa acae Breg- 
matic 
| Angle 

Race Bo y x y a y/Bo | x/Bo | y/Bo | 2/B, | y/Bo | «/B, 

: | Lee 
(i) (ii) (iii) (iv) (v) | (vi) | (wii) | (viii) | (ix) | (x) | (xi) J (xii) 
Eskimo ... 186-1 93-9 | 104-1 89:3 | 72-1 | 50-5 | 55:9 | 48:0 | 38:7 2-5 | 17-2 | 51°1 
Moriori ... 180-2 90-0 | 103-8 84:9 73:0 | 49-9 | 57-6 | 47-1 | 40-5 2-8 | 17-1 | 49°-3 
English ... | 183°5 84:9 | 103-7 83-0 77-2 | 46:3 | 56:5 | 45-2 | 42-1 1:0 | 14:4 |] 47°-1 
Guanche | 183-4 84-8 99:0 | 83:0 | 74:4 | 46-2 | 54-0 | 45-3 | 40-6 0-9 | 13:4 } 48°-1 
Egyptian * 181-9 84:4 | 106-2 82-5 | 79:0 | 46-4 | 59-5 | 45-4 | 43-4 1:0 | 16-1 | 46°-2 
Congo, Bantuf... | 175-6 85-4 97-7 83-5 70-6 | 48-6 | 55-6 | 47-6 | 40-2 1:0 | 15-4 | 49°-8 
B. Nasio-lambda Base Line. 
Eskimo ... eect Sosa: 71-2 89-5 712 |} 88:5 [38:9 | 48-9 | 38:9 | 48-3 0 0:6 | 38°-8 
Moriori ... | 180-0 66:0 | 100-2 66-0 91:0 | 36-7 | 55-7 | 36-7 , 50-6 0 5-1 | 35°-9 
English ... 181-2 68-7 90-0 68-7 90:0 | 37:9 | 49-7 | 37-9 | 49-7 0 0 37°-4 
Guanche... ... | 181-5 68-5 79-8 68-1 88:0 | 37:7 | 44:0 | 37:5 | 48-5 0-2 |-4:5 | 37°-7 
Kgyptian* ... | 180-1 70-2 | 85:2 70-0 89-6 | 39-0 | 46-8 | 38-9 | 49-8 0-1 |-3-0 | 38°-0 
Congo, Bantujy... | 176-3 69-8 | 90-1 69-8 84:0 | 39-6 | 51-1 | 39-6 | 47-6 0 3:5 | 39°°7 
C. Glabella-lambda Base Line. 
Eskimo ... ... | 184-5 | 66-0 | 100-4 65:2 | 87:1 | 35:8 | 54-4 | 35:3 | 47-2 0-5 | 7:2 | 36°-8 
Moriori ... ... | 181-0 61-2 | 102-0 60-6 88-7 | 33-8 | 56-4 | 33-5 | 49-0 0:3 7:4 | 34°:3 
English ... ... | 182-8 | 65-7 92-5 65-6 88-6 | 35-9 | 50-6 | 35-9 | 48-5 0 2-1 | 36°-5 
Guanche... vee | 183-2 | 64:6 82-3 64-6 82:3 | 35:3 | 44:9 | 35:3 | 44-9 0 0) 38°-1 
Egyptian* . | 180-9 66:8 91-5 66-8 88-9 | 36-9 | 50-6 | 36-9 | 49-1 0 1:5 | 36°-9 
Congo, Bantuf... | 176:6 64-7 91-7 64:3 80-9 | 36-6 | 51-9 | 36-4 | 45-8 0-2 6-1 | 38°-5 
| 


* XXVI—XXX Dynasties. 


{ Fernand Vaz, 1864 Series. 


In the above table columns (i)—(v) give absolute measurements, (vi)—(x1) 
provide indices and (xii) the “ bregmatic angle,” or angle the base line makes with 
the line joining nasion or glabella to bregma. 
measures in some manner the flattening of the frontal. 
of really determining the grade of primitiveness in the cranium. Other things 
being equal this angle will merely measure the extent to which the bregma is 
thrown back on the cranial vault, a relatively large frontal bone will in itself 
This is seen at once by comparing the relative 
position of the bregma as measured by the index x/B, for the bregma (column (ix)): 


produce a small bregmatic angle. 


It has been suggested that this angle 
It seems to us incapable 
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Nasio-gamma Base Line. 


Race Angle x/B y/B, Position as to size 
—| 
Egyptian... aco 46°-2 43-4 | 45-4 Fourth 
English tine ae 47°] 42-1 45-2 Second 
Guanche _... le 48°-1 40-6 45-3 Third 
Moriori Fes 400 49°-3 | 40-5 | 47-1 Fifth 
Bantu de S08 49°-8 40-2 | 47-6 Sixth 
Eskimo epeuseemders 51°-0 | 38-7 | 48-0 First 


We see at once from this table that the bregmatic angle has little if any relation 
to size of the skull. It gives absolutely the same order as the relative horizontal 
position of the bregma, and (except for the anomalous Egyptian, which is, however, 
greater than the English and Guanche by a slender difference) the same order as the 
bregmatic height difference. In biometric language we should say that the bregmatic 
angle is highly and negatively correlated with the horizontal position of the bregma, 
and is highly and positively correlated with the vertical position of the bregma. 
These results might be anticipated from its functional form tan-! (y/x), but we 
shall learn little as to racial order from the bregmatic angle. Indeed we might 
even state the more primitive the race the bigger the bregmatic angle. This 
conclusion is amply confirmed by the Cro-Magnon skull which gives: 

Angle x/By y/Bo 
55°-2 34-6 49-6 


Thus we must conclude that a high bregmatic angle is the result of a relatively 
forward bregma and is a sign of primitive rather than advanced cranial type. 
The reader must carefully distinguish between a “forward” bregma and a relatively 
large frontal bone. If we take as measure of the latter, the base Nf of the frontal 
bone in proportion to Ny, the order is: 


Congo, Bantu... 63-1 English We OIE) 
Egyptian O20 Cro-Magnon ... 61-0 
Moriori fe 62-1 Guanche wes 6029 
Eskimo... Giles) 


an order, to which it would be difficult to give any interpretation of value. 


When we turn to either the nasio-lambda or the glabella-lambda as base line 
we find none of the above identical order of vertical index, horizontal index and 
bregmatic angle. We do now find that the Moriori have a low instead of a high 
bregmatic angle, but if we were to associate this with a racially primitive condition, 
then the Congo Bantus would stand as the most advanced race in our list judged 
from either base line. As a matter of fact we see the explanation lies in the fact 
that the Moriori have from the nasio-lambda base the relatively least forward, and 
the Bantus the most forward position of the bregma, while the same is nearly 


110 A Study of the Crania of the Moriori 


true for the glabella-lambda base. The results found for these two base lines 
whether we consider the bregmatic angle or the horizontal or vertical bregmatic 
index are not in any accordance, and although these base lines must be used when 
the subject consists merely of a skull cap, or even lacks the nasion, we doubt 
whether for racial inter-relations they are nearly as valuable as the nasio-gamma 
base. The general measure of the flattening of the frontal bone is best read off 
as on our p. 95, and this gives the Moriori the premier position on the list. If we 
desire to measure the total physiognomic flatness we might, I think, combine 
the bregmatic angle for the nasio-gamma line with the angle the line joining the 
nasion to the are end of the nasio-bregmatic subtense makes with the nasio- 
bregmatic line. The following table brings out the contributions to frontal 
flattening due (i) to flattening of the frontal bone and (ii) rotation of the frontal 
bone as a whole. 


Race Frontal Bone Flatness tee ar of | ree Ct RE 
Cro-Magnon ... nds 24°-7 | 55°-2 79°-9 
Eskimo oma see 28°-0 51° 79°°3 
Bantu Be hie 27°°3 49°-8 77° 1 
Guanche wk. 26°-4 48°] 742-5 
English 7, sae 26°-6 47°°1 | 73°°7 
Moriori Sees ee: 22°-3 | 49°+3 | 71°-6 
Egyptian... Bee 24°-9 | 46°-2 algal 


This table seems of some interest. As we have before noted the physiognomic 
flatness consists of two factors; of these the rotation is roughly twice as important 
as the actual flattening of the frontal bone. Thus the English low physiognomic 
flatness is largely due to rotation; the Cro-Magnon skull with much frontal 
flattening is physiognomically at the top of the list owing to the excess of rotation, 
and in appearance has consequently a well-developed frontal. The Moriori 
stands at the bottom of the list for frontal bone flattening, but has less physio- 
gnomic flatness than the Egyptian, although the difference is very minute. It is 
clearly impossible to judge any grade of “primitiveness” by the mere smallness 
of the bregmatic angle, and we venture to hold that physiognomic flatness must 
in future be distinguished from flatness of frontal bone. 

Returning to our Table V, we note that in the case of the nasio-lambda 
base the maximum subtense for all practical purposes is the bregmatic subtense. 
In other words if the nasion, but not the lambda, be ascertainable on a skull cap 
a rough approximation can be made to the nasio-lambda line by drawing through 
the nasion a parallel to the tangent to the sagittal arc at the bregma. The reader 
will see by examining A Index «/B, column (xi), Table V, that the maximum falls 
sometimes on one side, and sometimes on the other of the bregma, but its position. 
never more than half-a-dozen millimetres from the bregma, causes the two subtenses 
to be equal owing to the general flatness in the region of the bregma. The maximum 
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difference between the two subtenses is 0-2 for the Guanche type contour, a 
difference well within our probable error of determination. If we use the glabella- 
lambda base line very little difference can be made between the maximum and 
bregmatic subtenses, although the maximum subtense now always falls behind the 
bregma, but by something less than 8mm. Asa matter of fact if we set off on the 
contour ff anteriorly and fl posteriorly to bregma both equal to about 15 mm., then 
the parallel to fl will give approximately NX and the parallel to Bl the glabella- 
lambda line. While these constructions are merely approximate for type contours, 
they will be only very rough for individual crania, yet in default of better rules 
they will give some idea of the position of the A, if only the bregma and either the 
nasion or glabella are known. If we want the actual position of the lambda we 
can only note that the foot of the bregmatic subtense is at an average distance 
-491 of the nasio-lambda line from the nasion, and at an average distance -474 
from the glabella in the glabella-lambda line, but these distances vary much for 
individual races, and a fortiort will vary more for individual crania*. 

Now the point just discussed, i.e. the practical equality of the bregmatic and 
maximum subtenses for both the nasio-lambda and glabella-lambda cases, suggests 
how important it is to ‘use a base line which will determine the extent to which 
the vertex of the cranium rises above and recedes behind the bregma. For the 
rising of the parietals above the frontal bone, throwing the vertex far back, is 
one of the marked features of primitive man. Whatever may be said against the 
Frankfurt standard horizontal plane--and many of the criticisms arise from 
insufficient experience of its value—it can hardly be disputed that on the average, 
1.e. in type contours, it is a fair approximation to the “horizontal,” remembering 
that even for the same individual the “horizontal” is only itself a sort of average 
position. Some plane must be taken to express the marked physiognomic characters 
conveyed by the idea of a retreating vertex and an elevated vertex. This no 
nasio-lambda or glabella-lambda line can accomplish. As far as we can find no 
very definite statement as to how the vertex is to be determined has yet been 
acceptedt. We define the “apex” to be the point of the sagittal suture vertically 
above the auricular line when the skull is adjusted to the standard horizontal position 
onthe craniophor. The “vertex” is the highest point of the sagittal or medial plane 
when the skull is in the same position. It is the point of contact of the tangent 
to the sagittal contour drawn parallel to the nasio-gamma line. From this stand- 
point we are able to give quantitative measures to the elevation and recession of 
the vertex relative to the bregma. All we need do is to subtract the indices y/B 


* Thus we attempted to reconstruct the Cro-Magnon skull from a knowledge of glabella and bregma. 
A very poor value of \ was found from :474, but an excellent one when the skull was assumed to be 
Guanche. In any such reconstructions it is probably desirable to use any suggestion of racial affinity. 

+ See Quain’s Anatomy, Vol. 1v. Part 1. p. 124 (1915), for example, where the vertex is defined to 
be the “highest point of the arch of the sagittal section.”’ In the diagram, Fig. 104 of this section, 
the nasio-lambda line appears to slope downwards instead of upwards and the “vertex” for this 
position appears to be in front of the bregma. In Fig. 112 the nasio-lambda line is made nearly hori- 
zontal and this adds much to the prognathous physiognomic effect aimed at! 
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and «/B of the vertical and bregmatic subtenses. There results the following 
table: 


Race Elevation Recession 
Moriori AEE Bas ner 2°8 17-1 
Eskimo ane aS abe 2:5 17-2 
Egyptian, XX VI-XXX Dyn. 1-0 16-1 
Congo, Bantu Ber See 1-0 15-4 
English ste eae 33 1-0 14-4 
Guanche vas mae See 0-9 13-4 


The Cro-Magnon cranium stands between the Eskimo and Hgyptian with a 
recession of 17-2 and an elevation of 1-5. Owing to the flatness of the skull in 
the neighbourhood of the vertex, the values of the recession can only be looked 
upon as approximations. Far more racial determinations are desirable before 
conclusions can be drawn, but undoubtedly these measures of the elevation and 
recession of the vertex suffice to differentiate those races like Moriori and Eskimo 
in which a receding sagittal crest is a marked feature. 

As a last feature of the sagittal section we may consider the angle between 
the nasio-lambda line and the standard horizontal. We have found no case 
of the depression of this line below the nasio-gamma line; the racial elevation is 
always about 9° to 13°. We have: 


Race Slope of Nasio-lambda Line 
Moriori AOE Ase sac 13°-5 
Cro-Magnon ... go a 12°-6 
Eskimo Bes BA Ba6 12°-3 
English se a: aa 10°-5 
Guanche Ree sao aa 10°-3 
Congo, Ferdinand Vaz, 1864 9°-9 
Congo, Batetelu ce a 92°9 
Egyptian, XXVI-XXX Dyn. 8°-4 


Thus we again see the Moriori and Eskimo at the top. It might be thought 
accordingly that the elevation and recession of the vertex in Moriori and Eskimo 
was due to a mere tilt of the cranium. The reply to this is that there is an actual 
physical ridge, there being actual concavity in some cases of the parietals on 
either side of the sagittal suture. 


A useful piece of work would be to add the basion, the inion, the opisthion 
and the suborbital and auricular points to Crewdson Benington’s contours— 
they are given on his individual tracings; so that racial comparisons might be 
made of the lower portion of the cranium from his sagittal type contours. 


7. On Mean Values found from Type Contours. We now turn to the second 
problem relating to the type contours: How far can we be certain of mean values 
obtained from them? We may note several general points before we proceed to 
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Fic. III. Moriori Type Vertical Contour. ¢. (33 skulls.) 
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Fra. IV. Moriori Type Vertical Contour. 9. (21 skulls.) 


Fie. V. Moriori Type Horizontal Contour. g. (33 skulls.) 
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Fie. VI. Moriori Type Horizontal Contour. 92. (21 skulls.) 
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details. After all the values have been averaged, some error is almost invariably 
introduced in plotting, especially as the contours have to be somewhat thickened 
up for the engraver; thus readings should be made to the middle of the contour 
boundary. But even then with the greatest care the engraver seems to find it 
difficult to reproduce to exactly the same size; however strict the injunctions 
may be we rarely find the engraver’s final result to agree absolutely with the 
plotted contour. There may indeed be as much as a millimetre difference. In 
deducing new indices therefore it is well to read off the diagrams, disregarding 
the numbers placed upon them and take the ratio of lengths thus read. We will 
now turn to the type contours. Let us consider first the glabella-occipital length. 
We ought to be able to get this exactly from the sagittal mean sections, and 
approximately from the horizontal sections. There results by measurement: 


Mean glabella-occipital Length L. 


Males Females 
No. Value No. Value 
Direct Measurement... 35 186-9 22 178-6 
From Sagittal Contour ... 33 186-5 21 178-3 
From Horizontal Contour 33 185-4 21 177°3 


That the horizontal contour does not give the same result as the sagittal is to 
be anticipated, for the horizontal line through the glabella does not necessarily 
pass through the exact occipital point, but the measurement on the sagittal section 
should theoretically agree with direct measurement. Is the slight difference due 
in this case to the fact that one or two skulls could not be used for type contours ? 
Unfortunately the type crania in the earlier cases were not finished so fully in the 
occipital region as the present samples, but I give the following examples: 


Mean glabella-occipital Length L for Males. 


Race... English ecg aa Congo, Batetelu | Gaboon, Bantu 


} 
| 


Direct Measurement S08 || leat 186-9 100*, 184-1 50 
From Sagittal Contour... | 100 | 186-8 | 100 1844 | 41 
From Horizontal Contour | 100 185-9 100 | 183-5 41 


No. Value No. Value No. o | Va Value No. | Value 
| 
| 


177°8 50 179-5 
178-4 50 WES: 
177-5 50 | 177-6 


* The 100 3 crania to which direct measurement was applied were not absolutely the same as the 
100 selected for contouring. Fawcett gives 185-1, MacIver 184-5 for prehistoric Egyptians. 
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I think we may conclude from these results that : 


(i) It is needful to add about 1mm. to make the measurement on the 
horizontal contour agree with that on the sagittal contour. 


(ui) Unless the occipital and glabella are reconstructed with great accuracy, 
we may have an average error of something like 0-6 mm. in determining the 
glabella-occipital length from the sagittal section. 


Of course the maximum breadth B cannot be read off on the horizontal section ; 
the temporal breadth should be something greater than the minimum frontal 
breadth; it is for gs 95-5 and for 9s 92:3 against minimum frontal breadth, gs 95-3 
and 9s 94:0. I am unable to account for the defect of 1-7 in the females except 
by the difference of errors of reading with callipers and reading with a scale on 
the diagrams*. Probably it would be best in future contour drawings with the 
Klaatsch to mark off specially the temporal ridges, as one is apt in turning a sharp 
point on the contour to bring the side of the needle rather than its point into 
contact with the ridge, but I do not think this would lessen the dimensions; such 
a lessening might arise from shear of the plasticine on which the skull is bedded, 
if too great pressure were applied. Whatever the source of the error, the lesson 
must be to test each individual contour against the available individual measure- 
ment before proceeding to the type contour. 


I now pass to the transverse section. In drawing this we pass from “auricular 
point” to “auricular point,” the “auricular line” being defined as the tangent to 
the upper borders of the external meatus. We have been forced to the conclusion 
that when the skull rests on the Ranke craniophor, the “auricular line” corre- 
sponding to the top of the plugs (wedge type, not conical type) does not coincide 
with the auricular line determined by joining the upper borders of the external 
meati as marked on the transverse contour. The ear wedges have to be thrust 
some distance into the auditory meatust and thus their horizontal tops are 
considerably below the line joining the upper borders. They are of course not 
as low as the line joining the “centres” of the external meatus. Thus it comes 
about that we can get three auricular heights. The one from the centre of the 
auricular passages is that usually determined on the living, and by field workers on 
craniological material; the one given by the contour tracer is that which we ought 
to find in strict accordance with the Frankfurt Concordat, and the one actually 
found in laboratory work with a craniophor is only strictly in accordance with 
the Concordat, when the external meatus has an upper surface, actually horizontal 
and accurately passing through the uppermost point of the border of the auditory 
external meatus. 


We now turn to the type transverse contours and note in the light of the 
above remarks the auricular heights: 


* The cranium which it was not possible to contour had a large frontal breadth, but its omission 
only reduces the mean to 93-8 mm. 
+ In order that the skull may be stable on the craniophor. 
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Moriort Auricular Height OH. 


Male Female 
No. | Value No. Value 
Measured on Craniophor ae es 34 | 117-5 22 115-1 
Measured on Transverse Contour | 33 | 109-5 21 108-8 
Measured on Sagittal Contour sine oo 116-5 21 112-4 


Let us examine similar data for other races: 


Male Auricular Height OH. 


: at Egyptian, XXVIf qa.  p. ‘ 5. 
= Race... English “XXX Dyn. Congo, Batetelu | Gaboon, Bantu 


No. Value No. Value No. | Value No. Value 
| | 


Measured on Craniophor... | 135 114-6 7 100 113-6 | 47 113-9 2 ? 
Measured on Transverse 
Contour on soo || Oo) 111-5 § 100 110-2 | 41 112-0 | 50 113-6 


It will be seen that there is not much comparative material, but in all cases 
the craniophor auricular height is in excess of the transverse contour height. 
Dr Benington was more successful than I have been in making his type transverse 
contour approach the auricular height of the craniophor measurement*, but even 
he may be 2 to 5mm. in defect. My determinations from the sagittal contour are 
in better agreement with my craniophor measurements, but may be 2 to 3mm. 
divergent, while the transverse contour differs by 6 to 8mm. I attribute these 
big differences to the very different manner in which the auricular point is deter- 
mined (i) when the skull is on the craniophor, (ii) when it is orientated by auricular 
points and apex, (i) when it is orientated by nasion, bregma and lambda. In 
the latter case in particular there is no certainty that the auricular line is per- 
pendicular to the plane of the paper. In (ii) the auricular point is reached by 
a stile arriving from the apex, in (iii) by a stile adjusted from the basal side of 
the orientated skull, and there is far more personal equation in its determination 
than may be at once realised. It is clear, I think, that determinations of the 
auricular height from the transverse type contour are of small value. But I 
doubt very much whether the auricular height as found on the Ranke craniophor 
accords well with the definition of the Frankfurt concordat, or indeed whether 


* It is possible that the top of the auricular passage is less nearly horizontal in the case of the 
Moriori than in the races which Dr Benington dealt with. 
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the “auricular line” is capable of sufficiently close definition*. At any rate the 
transverse contours need clearer determination at their lower terminals. In further 
work it might be valuable to place upon them the projections of the basion, 
suborbital and alveolar points. 

I have not taken directly the interauricular breadth of my Moriori skulls. 
Read off from the transverse contours, I find: gs, 125:2, and 9s, 117-6, which 
agree fairly well with Scott’s measurements, namely gs, 126-8, and 9s, 118-5, 
although mine for both sexes are consistently less. 


I made an attempt to ascertain the circumferences from the contours and found : 


Male Female 
Circumference 

By Tape From Contour By Tape From Contour 
Transverse: Q BAG dee 316-1 300-1 (316-1) 310-0 297-3 (309-6) 
Horizontal: U ie 36 522-8 522-0 502-6 501-0 
Sagittal: 8, od are 126-0 125-7 121-4 121-6 
Sagittal: S, ao ee 122-4 122-9 117-4 117-4 
Sagittal: S83 el see 121-0 118-9 116-0 110-0 
Sagittal: S = 8S,+ S,+ 83 369-4 367-5 354-8 349-0 


After the contour measurements for Q are placed in brackets the results which 
would be reached by adding to the contour arc twice the difference in the auricular 
heights. The agreement is then very close, showing that the divergence is solely 
due to difference in position of the auricular line and not to shape of contour. 
The results on the whole are fairly close; the occipital arc is the worst, and I have 
possibly not accentuated sufficiently the ridges of the occipital and inionic regions f. 


I now give (Table VI) a number of other characters which can be read off the 
type sagittal contour with comparison of their measured values. On the whole 
the agreement is not bad. Indeed it is sufficiently good to suggest that with the 
experience now gained an improved technique would enable the sagittal type 
section to be directly used as a means of determining the customary racial 
characters of the skull, or that at all events this section would serve as a valuable 
control method for verifying the mean values determined by direct measurement 
and arithmetic. The measurements of greatest divergence are: 


(i) The foraminal length. The type contour exaggerates it by 2mm. about 
for men and 4mm. for women. But the draughtsman has plotted in the case 
of the female skull the opisthion with an abscissa of 44-6 instead of 46-6 mm. from 
the gamma. Clearly the error observed in the occipital arcs is largely due to this 
displacement of the opisthion 2mm. in the male and 4mm. (2 of which are by 


* It is proposed to follow this matter up by actually continuing the tracing of the transverse section 
into the auditory meatus to the limit of the Klaatsch tracer’s possibilities and comparing with the 
craniophor auricular heights. 

+ See, however, the remarks later on this point. 
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Moriori Crania, Typical Male Skull, No. 765%, N. facialis, 


Plate Ill 


Biometrika, Vol. XI, Parts | and II 


SUDO) “NT 


ON TEAS TPA 


eord dy, 


“VIURIQ TOMO] 


Biometrika, Vol. XI, Parts | and Il Plate IV 


Moriori Crania. Typical Male Skull, No. 765% N. occipitalis. 
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Moriori Crania. Typical Male Skull, No. 765". N. occipitalis, 
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Moriori Crania. Typical Male Skull, No. 765%. N. basalis. 
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Moriori Crania. Typical Female Skull, No. 763. N. verticals. 
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Moriori Crania. Typical Female Skull, No. 763. N. basalis. 
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Moriori Crani \ Sk 765 s 
a. Male Sk 359 rk t 
é Skull, No. 7655. Marked asymmetry of foramen magnum. WN. occipitalis. 


Biometrika, Vol. 


XI, Parts | and Il 


Plate XIV 


N. lateralis. 


5 


Male Skull, No. 76 


ranla, 


Moriori C 


' : bi F : i oe © & ; : 
A + < : - ' . 
. : : : ° 
: . 7 ‘ 
‘ ° 7 7 a ‘ 
‘ 5 ' : : 
A F Fi: 
‘ ' b 


Plate XV 


Biometrika, Vol. XI, Parts | and Il 


“S1pDAID] NT 


‘ropovreypo UT oipeydoooayyeq [eV 


yd1990 


OINJAS TVUOIOD FO 


“ON “TMS IVT 


SUBIL) TLOTLOTAL 


nar, 


Biometrika, Vol. Xl, Parts | and Il Plate XVI 


Moriori Crania. Male Skull, No. 762. Ossicles of the lambdoid suture. N. occipitalis. 
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Moriori Crania. Male Skull, No. 76527. Massive elabella. N. facialis. 
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Moriori Crania. Skull, No. 76517, with marked bilateral paraoccipital processes articulating with atlas. 
For a unilateral case: see Biometrika, Vol. v. p. 104, Plate X. 
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Moriori Crania. Male Skull, No. 765%, illustrating fracture of left ramus of mandible. 
N. facialis, showing asymmetry produced by the injury. 
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TABLE VI. Measurements from Sagittal Type Contours. 


Male Female 
Character =T 
By Cranial Average | By Type Contour | By Cranial Average | By Type Contour 
tac ba5 182-0 182-2 176-1 175:8 
Veins oi one 185-6 185-6 177-8 178-2 
17 eee 200 135-9 135-4 132-8 131-4 
IB... Son 105-6 106-0 102-3 1015 
GH... Bop 76-4 77-0 70-0 711 
GB ... 080 103-2 | 101-0 96-6 97-5 
fml.... asc al| 35:3 | 37-2 34-4 38-5 
Ocl ... ate 62-2 62-1 64-2 64-2 
Frontal Index 21-6 19-4 21-9 21-1 
INL Oe abs 64°-9 64°-8 65°°-7 66°-3 
Aes ie aes | 71°-8 | 71°-9 73°-6 72°°2 
UBL 3s oa6 | 43°-4 43°-2 40°-7 41°-5 
1 5 ee 500 84°-7 | 88°-0 84°-5 86°:8 


draughtsman’s error) in the female. I take it that in the measurement of the 
foramen with the dividers the tendency is to measure the inside border of the 
foramen, but with the Klaatsch contour tracer to stop just short of the edge. It 
seems to me reasonable to suppose that if the basion and opisthion were first 
marked off as points and the foraminal length then compared with actual measure- 
ment, we should insure not only the exactitude of the foraminal length, but of 
the occipital arc. 


(iu) The profile angle. Here the difference is 3°-3 in the males and 2°:3 in 
the females; this is serious. IJ take it to arise from a divergence in the deter- . 
mination of the alveolar point (i) with the sagittal contour vertical and using the 
goniometer, and (ii) with the sagittal contour horizontal and using the Klaatsch. 
But I must reserve the discussion of the point until further experiments have 
been made. The fundamental triangles are in quite good accordance, and 
accordingly the evil may lie in the determination of the auricular point, when 
the skull is off the craniophor. We note how closely the occipital indices 
deduced from the contours approach those obtained from the measurement of 
occipital arc and chord. Indices from the contours were taken as 7 = occipital 
subtense/occipital chord. Then if I be the occipital index as defined by 


Ss Sy 
8’, V 24 (Ss — S's)’ 


I= 100 
it is easy to deduce that 
| Ree 
T= 100 (56+ 5:)- 
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The occipital indices so far obtained are: 


Male Female 
English (Moorfields) ... 900 58-5 59-2 
Moriori. ... sae ais Bae 62-2 64:2 
Gaboon, Bantus An a 68-3 69-2) 4 
Congo, Batetelus ne 66 ea eae 67-05 Ga 


The smaller the index the smaller is the radius of curvature of the occipital 
arc in relation to its chord, i.e. the occipital is more rounded. The order shows 
that the English have most, the Negro least occipital development, and that 
the Moriori male stands nearer to the European than to the Negro. 

The Frontal Index is on another footing. If the nasio-bregmatic chord and 
arc be given, i.e. S’,; and S,, then 

Sy Sy 

Tare SaN 24(S)='S)) 
gives the frontal index in the form of the ratio of the radius of curvature of the 
frontal arc to the frontal chord. To obtain this index the distance from nasion 
to bregma along the chord should be measured with the callipers. Not having 
measured this, I took the values from the contours which should be extremely close 
to the measured values as the terminals are very definite points. The results were: 

Males, 68-7; Females, 67-9. 
If now we wish to replace the index by 7, = 100 x subtense/chord, we must 


deduce 7, from 
aa I, x I, 2 5) ) 
vee (To . Gia) i 


There resulted the values 21-6 and 21-9 to compare with the direct contour values 
of 2;, 19-4 and 21-1. The reason for the excess is clear: it is due to the exaggera- 
tion of the length of the frontal arc (treated as a circular arc) owing to the 
massive glabella, more divergent in the case of the male than the female*. On 
the other hand if we take English males, we find from are and chord measurements 
of the frontal J; = 70-1 leading to 7, = 21-0, which is in closer accord with 22:2, 
the direct value of the subtense to chord index, thus indicating that with less 
glabellar prominence, the results of the two methods would be more nearly equal 
and would more closely justify the assumption of the approximate circularity of 
the frontal arcy. On the whole if the frontal curvature is to be determined by 
direct measurement, i.e. not by contour drawing, it will be desirable to adopt a 
modified form of Merejkowsky’s simometer{. It would not be hard to devise, but 
it would have to be considerably larger, and the subtense would have to be 
obtained by contact—it would not necessarily be at the bisector of the chord. 


* Had we supposed the are to be catenary rather than circular the values (see Tables for Bio- 
metricians and Statisticians, p. 64) would have been 22-7 and 23-1, still more divergent. A parabolic 
are was found to be even less successful. 

{+ Tasmanian aborigines (Berry and Robertson, R. S. Hdin. Proc. Vol. xxx1. Table p. 60) give for 
combined sexes, from J; (64°91), i, = 22°33. 

{ Borometrika, Vol. vii. p. 317. 
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To sum up: the comparison we have made suffices to show that cranial type 
contours can be of great service; they are justified by the generally good con- 
cordance they give when tested against average measurements of the skull made 
in the ordinary manner. But if they are to serve for the purpose of accurate 
measurement and for more than a rough comparison of race with race, we think 
a considerably improved technique will be essential and if, as we believe, it is 
desirable to compare the contours of the individual crania step by step with their 
actual measurements, then the contour drawing will not relieve us from the usual 
craniometric processes, but at first serve only as a control. In particular the 
difficulties which have arisen in our own case centre especially round: (a) a better 
agreement in the position of the auricular point as determined (i) by the 
craniophor and (ii) on the transverse and sagittal contours; (b) in the better 
location of the opisthion and alveolar point on the sagittal contour. 


These do not seem substantial difficulties, but they need practice and caution. 
The basion can always be inserted from the nasio-basionic length and_basio- 
bregmatic lengths (0B and H); the opisthion from the foraminal length and the 
lambda opisthionic chord (fml and S’;). The alveolar point is then determined 
by the upper face height (@H’) and the profile length (GZ). To avoid the 
contour determination of the auricular point, we might mark the apex (A) of the 
individual skul! when on the craniophor as well as the gamma. Then on the type 
contour by drawing a line through the average apex perpendicular to the average 
Ny, and plotting off along this line the mean auricular height (OH), we should 
obtain an auricular point which would agree with the cranial measure ents. 
The parallel to Ny through the auricular point thus determined would give the 
standard horizontal plane, and the profile angles by contour and measurement 
should now coincide. 

A marked feature of the type Moriori sagittal contours for both male and 
female is evident on inspection—the auricular point is vertically above the basion. 
It is not true for individual crania, but it may be true for type crania (i.e. on the 
average) for other races than the Moriori. If so the vertical through the basion 
would give the apex and the auricular point could be determined from the mean 
auricular height, and control would be easier to establish. In general the basi- 
auricular line as an approximate or average vertical of the head deserves 
consideration *. 


* Tt is proposed shortly to observe skull by skull the difference between the contour and the direct 
measurements with a view to improving technique. The English male sagittal type contour when the 
basion and alveolar points were added as above suggested gave a profile angle of 86°-3, according 
reasonably with the measured angle 86°-1, but the same method applied to the Congo Batetelu 
cross-section only gave a profile angle of 81°-2 as against 82°-8 measured directly on the crania. The 
actual marking of apex and gamma when the skull is on the craniophor and the transfer of these 
marked points to the contour will undoubtedly aid in ascertaining what is needful to bring contour 
measurement and direct measurement systems into full unison. At the same time it must be 
remembered that the characters of a mean or type contour cannot be demonstrated theoretically to 
coincide absolutely with the means of characters not used in constructing the contour. 

9—2 
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8. On the Moriori Mandible. A difficulty arises in treating mandibles in 
general from the fact that they are usually very few in number, often separated 
from the skulls, and then difficult to sex, and further that as many writers neglect 
them altogether there are few data for comparative purposes*. The present 
material contains only eleven mandibles and of these one certainly did not belong 
to the skull with which it was associated. Four were female and seven were male. 
Scott has fourteen mandibles, twelve male and two female. In the case of the 
Maori he has 24 male and seven female mandibles. In the following table I have 
added Scott’s data to mine for the Moriori and compare results with Macdonell’s 
for English, Fawcett’s for Naqada Egyptians, Koganei’s for the Aino, and a few 
scattered results for Negro races. 


TABLE VII. Measurements of the Mandible. 


Race... Moriori Maori Aino 

Sex ... 3 Y jar 3 2 Sars || So g 3+ 9 
Bicondylar Width.. ... | 124-1 | 114-8 | 122-3 | 127-2 | 120-9 | 125-8 | 102-0) 95-6 | 99-5 
Bigonal Width ... 106-4 | 101-6 | 105-3 | 104:5| 97-3} 102-8) — — — 
Height at Middle Incisors 35:3 | 30:7] 34:2] 34:5| 29-7| 33-4] 33-8 | 32:6 | 33-4 
Mental Distance ... 51:6| 49-1] 51:0}; 50:5) 49-1} 50-2) — — — 
Total Numbers (round) ... 19 6 25 24 qu 31 60 35 95 


Race... English Gaboon, Bantus SNe Fuegian | Egyptian, Naqada 
Sex ... ore 3 2 | 3+9 3 eel S 2 eae 
Bicondylar Width.. 113-2 | 112-6 | 110-0 | 111-6 | 117-0 120-6 | 110-5 | 106-4 | 108-2 
Bigonal Width : 95-4 94:3] 90-1] 94-4 97-2 103-4 93-6 | 87-6} 90-4 
Height at Middle Incisors _ 30-8 32-7 | 30-5] 31-8 — 35-4 32-9 | 31-6} 32-2 
Mental Distance ... . 43-7 45-4} 44:3] 44-9 aa = 44-4 | 43-1] 43-7 
Total Numbers (round) ... 140 29 20 49 16 5 50 60 | 110 
Now these results, slender as are the numbers, seem of extreme interest. All 


comparison of the English and Aino mandible with that of the Moriori is now 
out of place. The Moriori, Maori and Fuegian are the only mandibles with a 
bicondylar width over 120mm. and a bigonal width over 100. But the most 


* The Laboratory has in hand a series of many hundred Egyptian mandibles on which a much 


fuller system of measurements has been made. 


from this material. 


It is hoped to reach more definite criteria for sexing 
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marked differentiation of the Moriori and Maori from those of other races is the 
distance apart of the foramina mentalia which is over 50 cms. It is a misfortune 
that we have not this distance for the Fuegian mandibles, in order that we might 
ascertain whether it is usual for heavy large mandibles to have a mental distance 
of these dimensions, or whether it is a special racial character in Moriori and 
Maori. Judged by the general massiveness of mandible the racial order is: 

Moriori and Maori—Fuegian—Northern Negro—English—Gaboon Bantu— 
Egyptian—Aino. 

Taken in order of depth of chin, we have: 

Fuegian—Moriori—Maori—Aino—Kgyptian—Gaboon Bantu—English. 


Lastly, if we class these races by the index 100 x bigonal width/bicondylar 
width, which to some extent measures the verticality of the ramus, we have 
the order: 

Moriori (86-1)—Fuegian (85-7)—Gaboon Bantu (84-6)—English (84-3)— 
Egyptian (83-6)—Northern Negro (83-1)—Maori (81-7)—Kaffir (81-0). 

These orders are not very illuminating except as grouping the primitive races 
of the southern hemisphere together by massiveness of mandible, and showing 
the Moriori with the most vertical, the Kaffir with the most sloping ramus. It is 
clear that two racial] types of skull may approach each other fairly closely, while 
there can at the same time be a wide difference between their mandibles. 


9. Description of and Remarks on the Photographic Plates of Moriort Crania. 
Plates II, III and IV show a typical Moriori male cranium, No. 765%. Plate IT 
gives the norma facialis and indicates the high nose, the massive glabella, the 
flattened frontal and receding temporals. Plate III, the norma lateralis, left profile, 
indicates not only the same last three points, but the post-bregmatic depression 
followed by the sagittal crest, and further the massive mandible with vertical 
ramus. Plate IV, the norma occipitalis, marks the pentagonal form of this aspect 
as well as the generally massive character of the Moriori skull. The apex of 
the “pentagon” is the result of the sagittal crest. 


Plates V and VI, No. 76519, give another typical male cranium. Plate V, 
the norma lateralis, right profile, shows well the characteristic frontal region 
and the marked sagittal crest, also general massiveness in the occipital, inion 
and mastoid regions. Plate VI, the norma occipitalis, brings out, in a still more 
marked manner than even Plate IV, the sagittal crest and indicates well the 
flattening of the parietals on either side of the sagittal suture. 


Plates VII and VIII are again photographs of a male skull, No. 765%5. Plate 
VII, the norma lateralis, left profile, is an endeavour to catch the marked temporal 
line and the recession of the temporal bone behind it, which gives the characteristic 
aspect to the horizontal contour. Plate VIII gives the norma basalis and indicates 
well the large and rotund character of the foramen magnum, although the fora- 
minal index of this cranium does not reach the mean value. The heavy inion 
tidge is to be noted. 
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Plate IX, No. 765, and Plate X, No. 765%, give the vorma lateralis, left 
and right profiles respectively, of typical female crania, We see the same 
features as in the male, the massive frontal, marked temporal lines, receding 
facies temporalis, and the elevation and recession of the vertex produced by the 
sagittal crest. These features are less emphasised than in the male, but are in 
striking contrast to the female characters of less primitive races. 


Plates XI and XII provide the norma verticalis and norma basalis of a typical 
female, No. 763. In Plate XI we see the pear-shaped vertical aspect*, and in 
Plate XII the characteristic palate, and the asymmetrical foramen. 


Plates XIII and XIV further illustrate in a male cranium, No. 765°, the racial 
characteristics. Plate XIII shows in the norma basalis a marked asymmetry of 
the foramen as well as the general massiveness of the base. Plate XIV, the 
norma lateralis, right profile, gives the usual primitive factors of massive frontal 
sinuses, receding forehead, sagittal crest and vertical ramus; but the vertex in 
this case while markedly elevated does not recede; as the sagittal contour of this 
cranium indicates it is practically at the apex. 

Plates XV and XVI give norma lateralis, left profile, and norma occipitalis 
of a male skull, No. 762. Besides all the usual racial features of which the 
depression of the facies temporalis is especially marked, we see a rarely occurring 
ossicle of the coronal suture, and a protrusion of the occipital almost amounting 
to bathrocephaly. Plate XVI indicates the source of this in the conglomeration 
of ossicles in the lambdoid suture. 


Plates XVII and XVIII reproduce another male skull, No. 765?’, with massive 
frontal, receding forehead, depressed facies temporalis and sagittal crest. Plate 
XVII, the norma lateralis, left profile, shows an epipterict. Plate XVIII, the norma 
facialis, indicates how much of the frontal bulge is really due to glabellar protrusion, 
and how little nasal resemblance the Moriori has to the Negro. 


We conclude this series of photographs with reproductions of two crania of 
some interest. Plate XIX, No. 765!’, shows a cranium with marked para- 
occipital processes articulating with the atlas. Macdonell has figured in Plate X 
of his memoir on the Liverpool Street English craniat a case of unilateral para- 
occipital process of a like character, the present Moriori instance is bilateral. 


Plates XX, XXI and XXII of a male skull, No. 7652, illustrate how without 
surgical aid recovery from a serious injury is possible in a primitive people. The 


* The pear-shaped cranium of Plate XI should be compared with the pear-shaped cranium Plate XI 
of Macdonell’s paper, Biometrika, Vol. v. p. 104. The reader will understand at once what we mean 
by the heavy temporal lines and the depression of the facies temporalis in the Moriori. Indeed, if the 
reader will compare the photographs of the norma verticalis in Macdonell’s two papers (Biometrika, 
Vol. mz. p. 216, and Vol. v. p. 104) on the English skull, with our Plate XI, he will appreciate why the 
Moriori horizontal type contour differs so essentially from the English horizontal type contour. 
I regret that I have not provided more Moriori photographs illustrating this essential difference. 
In the Moriori crania the norma verticalis shows the temporal ridges, in the English they rarely 
appear at all. 

+ The curved dental arch is also worthy of note. { Biometrika, Vol. v. p, 104. 


Biometrika, Vol. XI, Parts I and II 


REMARKS 


872 | Basilar synchondrosis. 3rd molars still in alveolus. Small epipterics on R. Infra-orbital suture meets 
dllary suture R. and L. Flattening on both sides of sagittal suture. Bulging of parietals. Flattening of 


92 th. Sutures open. Flattening of obelion. Sagittal crest. 


104°5 ondrosis. Sutures open. Malar marginal processes R. and L. Small ossicles in lambdoid suture. 
ures R. and L. Sagittal crest. Flattening of obelion. Ossicle near R. asterion. 


103 Sutures open. Infra-orbital sutures R. and L. Sagittal crest. Flattening of obelion. Narrow head with 
wrietals. 


too larsynchondrosis. 3rd molars stillin alveolus. Very asymmetric foramen. Sutures open. 


97'5als. 


Infra- 


Bulging parietals. 
Sutures open; remaining teeth good. Curious bony bridge on R. of foramen. Many wormians in lamb- 
e. Bulging parietals. Flattening of obelion. 
945ninence with highly receding forehead. Sagittal crest. Ossicle in coronal suture. Great complication of 
lambdoid suture. Infra-orbital suture meets malar maxillary suture R. and L. Asymmetric foramen. 
| ight torus palatinus. 
| 103 utures R. and L. Sutures open. No 3rd molars. Large shallow palate. Torus palatinus. Small ossicle in 
suture. Bulging glabella. Marked malar marginal processes. Marked inion ridge. Incisura crotaphitico- 
us on R. 
98 crest. Bulging parietals. Marked inion. Sagittal suture almost entirely obliterated, coronal suture 
obliterated. Bulging glabella. Marked malar marginal processes. 
104 Fluttening of obelion. Bulging parietals. Marked malar marginal process on R. 
g9'\stoids. Teeth good. Sagittal suture almost entirely obliterated. Lambdoid suture becoming obliterated. 
lar‘marginal processes R. and L. Ridge on parietals R. and L. Deep palate. 
103 forehead with bulging glabella. Asymmetric flattening of frontal bone. Bulging pavietals. Numerous 
in lambdoid suture. Symmetric flattening of parietals each side of lambdoid suture. 
d, especially face and base. Very flat head with very bulging parietals. Sutures very open. 


Large 


g8neets malar maxillary suture R. and L. Teeth very worn. Small epipterics R. and L. Sutures very open. 
vat R. pterion. Bulging at mid-sagittal. 


103ng, with slight depression. Coronal and sagittal sutures much obliterated. Considerable absorption of 
n. Heavy inion ridge. Mandible fractured. 
tozull. Sutures open. Teeth good, but bad abscess of bone round R. canine. Anterior supra-mastoid tubercles 
Bulging parietals. Flattening at obelion. ~ 


tohead. Infra-orbital suture meets malar maxillary suture on R. Sagittal suture considerably obliterated. 
inion. Marked malar marginal processes. Mid-sagittal crest. 
10/forehead, and high sagittal crest. Sutures open. Some absorption of alveolus. Anterior sagittal bulging. 
uar marginal process on R. Asymmetric foramen. ; 
gaead. Sagittal crest; bulging parietals with some depression. Bulging occipital. 


toobliterated. Infra-orbital meet malar maxillary sutures R. and L. Epipteric on R. 21 mm. long. Receding 
rulging parietals. Marked malar marginal processes R. and L. 


tola. Receding forehead with sagittal crest. Depression at obelion. Sutures very open. 
Infra-orbital suture on L. Ossicle at lambda (18 x 18). Small epipteric on R. 


toforehead. Infra-orbital suture on R. Teeth good. Sutures becoming obliterated at pterion and obelion. 
lambdoid suture. Heavy inion ridge. | 
creceding forehead. Curious high mid-sagittal crest. Bulging occipital with heavily marked inion ridge. 
soming obliterated. Ossicle at lambda. Some depression of parietals. 

tatures R. and L. Marked malar marginal process on R. Flattening at obelion. 


Some absorption 


Bulging parietals. Sutures 
lightly obliterated. Epipteric on R. (25 mm. long). 
Iature meets malar maxillary suture R. and L, Marked malar marginal processes. Bulging parietals; high 
lerest. Sutures open. Teeth worn. 


vial bulging. Teeth much worn. Curious ridge on occipital base. Flattening at obelion. Aural exostosis 
ped superciliary ridges. Low receding forehead. Infra-orbital sutures R. and L. Coronal suture open; 
ure becoming obliterated. Many ossicles at lambda, and in lambdoid suture. Occipital bulging. Heavy 
*y marked supra-mastoid tubercles R. and L. 

astoids. Flattening of obelion. 
s open. Small epipteric on R. 
ila. Receding forehead. 


ligh sagittal crest. Asymmetric occipital bulging. Marked malar marginal processes. Small epipteric R. and L. 
ia 


Very heavy skull. Receding forehead. Malar marginal processes R. and 


mbdoid suture. Teeth good. Para-occipitals, especially the left, are large. Two tubercles on the basi-occipital. 


TABLE ,OF MEASUREMENTS OF MORIORI SKULLS IN THE ROYAL COLLEGE OF SURGEONS’ MUSEUM . Biometrika, Vol. XI, Parts I and IL 


Capacity and Weight Lengths Ages : 
pan eees Bees: Rae ; Seas. se Face Nose Orbit Palate and Profile Foramen Mandible Tmdines 
ZS Se ee Se a Fe ai s 2 = Bs Angles 
C W. i F ie L B B | ] = as ae z = = ~~ Ee 2g 
- IN grs. bj 7 H OH LB Q Q’ Ss is S. i} 9’ U Q@H 
1 F & | GB J NH N. 0 al 9 : 
: saa AS SOT COU, OR Ob ey G Ce GE fml | fmb GH Oh We hy ee Oct | B/L ASL B/L A/L B/H |G H/GB\NB/NH ee ee C/G ie Ne AL BL 0, 0, Pk REMARKS 
d 407 170°5 | 170°5 | 170-5 | 134 89 | 12 : : : * : Se SEAS ES 
5 | 1105} 94°5 | 299 | 299 | 341 | 117 | 115 | 109 | 92°5 | 481 59 83:5 | 115? 47°56 2 : 
: 2! 47°5 29) -) 39 3 0? 22 2 872 5 c 9 ‘ 
4 35 34 4 4 2 7 35 29 d d d d d 62-1 78°59 73°31 78°54 73°27 | 107:20 71:06 52°85 87-40 87-50 69°52 83°33 64°°1 17°8 38°-1 T0°-2 27°09 88° Very young. Basilar synchondrosis. 3rd molars still in alveolus. Small epipterics on R. Infra-orbital suture meets 
: I ill suture R. and L. Flattening on both sides of sagittal suture. Bulging of ietals, Flatteni f 
1280 (1278) i: ere) ip || w7kS) | GS) || 10395} 89 132 IIo | Lor 295 | 295 | 352 | 119 | 122 | 111 | 98 493 68 94 126°5?| 52°5 25 43°5 425 37°5 36 45°5 48 36 92 35°5 32:5 | 108 a 3 6 6 ero | nape 2 . : Patil 4 bos sc aica 
d 608 180 | 184 | 184 | 136 95 | 132 | 4117 | 109 | 318 | 314 | 358 | 120 | 123 is < “ ee 21 4 672 | 7:72 | 75:00 | 76-70 | 75:00 | 102-27 | 72:34 | 47°05 | 85-17 | 85:35 | 75°00 | 91-26 | G2°0 | 76°9 | 4z1°r Orr 32°:0 | 86° Very good teeth. Sutures open. Flattening of obelion. Sagittal crest. 
3 23 | 115 | 1005 | 5122) 73 97 Taps | 56 25 45 45 x | 39 0 2 40 | To4- 37° I d 5 : 
BES ; 2 5 2 45 S75 3 q d d d 65°5 WSO 7274 73:92 71°74 | 103°03 75°10 44°64 86-28 86-22 76:62 82:57 66°-9 73-0 40°-1 129 Dap rit 86° Basilar synchondrosis. Sutures open. Malar marginal processes R. and L. Small ossicles in lambdoid suture. Intra- 
30 (1235) 605 169 .| 172 | 172 | 136 86:5 | x II I Y. 
g a 3 5 | 133 4 OL 300 | 300 | 349 | 120] 119 | 110 | 98 480 72 94 127 54:5 DD 41 40? 38 37°5 40 | 53 35 103 30 29'5 d d d d d Gop os a £ i orbital sutures R. and L. Sagittal crest. Flattening of obelion. Ossicle near R. asterion. 
g 1410 (1397) 505 17575 | 175 176 135 95 132 114-5 | 100-5 | 314 | 312 | 350 | 122 6 88 8 | ie Dia 79°07 77°44 79°97 77-44 | 102:Io 76:80 40°37 93°75 92°68 66-04 97:01 71°-8 66°-6 41°-6 14°4 27°°2 81° Teeth good. Sutures open. Infra-orbital sutures R. and L. Sagittal crest. Flattening of obelion. Narrow head with 
2 II7 | 111 | 100 500 7 . II 2 ° ; oR 6 ss : 5 bulgi ietals. 
Q 1420 (1425) e eee ryt |172 | 173 | 143 | 100 | x39:5| 119 | 103 | 325 | 326 2 e i i = ae z 30 # ae 232 2 Pia ¢ 33 123 325 95°5 3°°5 5° TD HIP 75°43 76°79 75°08 | 102-27 75°72 52:08 90-50 89:23 79°89 d 68°-2 73°°1 38°°7 89 29°-8 82° Mone Penh epiohondrosia 3rd molars still in alveolus. Very asymmetric foramen. Suturesopen. Bulging parictals. 
z . z SES |} We | Bak sets). || 29) 504 | 73°5 | 10 136 5595 | 23 43 ASM ESS 36 8 53 35? | 9775 5 o d : ; 
3 1360 (1368) Ge ues) 182-5 | 188 | 189-5 | 136 8 vay || wel Sha : ee ce e : a g ¢ g 65-0 | 83-14 | 8r-r0 | 82:66 | 80-63 | r02-5r | 69:34 | 41:59 | 90°50 | 8z-or | 66:28 | 85-95 | 64°-1 GES | aspen 13°2 29°-9 86° Bulging parietals. Sutures open; remaining teeth good. Curious bony bridge on R. of foramen. Many wormians in lamb- 
9°5 | 13 9 7 5| 9 95 | 299 | 372 | 12 | 129 | 122 | 99°57] 528 | 74 99°5 | 133 57 24 46 43 39 38 53 | 55 395 | 945 | 335 | 30 d d d d d 58-1 | 72:34 | 67-44 | 7x80 | 66-94 | 107-25 | 74-3711 42-25 | 87:50 | 8 ee | & OG 5 é 2 P Glover Guttahy Puts ie Pariewels: Plautonne oe phot: 
x : as : : é °Z Si, ze E 4°34 “02 BP y/ 40° 70°-2 45°°2 22: 32°-4 83° Glabellar prominence with highly receding forehead. Sagittal crest. Ossicle in coronal suture. Great complication of 
be ossicles in lambdoid suture. Infra-orbital suture meets malar maxillary suture R. and L. Asymmetric foramen. Large 
9? LEOO (1159) 670 172 |175 |275 | 133-5] 88 | 128-5] 114 || r0o | 314 | 306 | 346 | 115 | 123 | 108 | or 489 67 95 126 50 24:5 4t 42°5 34:5 35°55 52°5 56 38 103 33:5 30°5 d d d d d 6a eer ertis ras J i palate. Slight torus palatinus. 
f = E Ul ers 734 YE 73°48 | 103-81 | 70:53 | 48:60 | 83-33 | 84:39 | 68-20 | 90-91 | 73°:0 68°-3 38°-7 aby 26°-0 81° Infra-orbital sutures R. and L. Sutures open. No 3rd molars. Large shallow palate. Torus palatinus. Small ossicle in 
; lambdoid suture. Bulging glabella. Marked malar marginal processes. Marked inion ridge. Incisura crotaphitico- 
3 1545 (1539) E: oe Si) 185 188 189 | 146:5| 93 | 132°5| 116 | 102 310 | 312 | 377 | 133 | 124 | 120 | 99 530 82: 108 136 61 26 45 44°5 37°5 38:5 51 54 40 98 38 31 124 TaD naG 31 52°5 < 81 é: = es J 4 buccinatorus on R. r 
° eee ts 7 a oe - : | 2 2 59 U7 72°47 | 77-40 | 7OLO | 110-4 | 76-48 | 42-48 | 86°68 | 83:55 | 74:07 | 82-04 | 63° | 68%r | 48°8 | 14°9 | 33°9 | 83° High sagittal) crest. Bulging parietals. Marked inion. Sagittal suture almost entirely obliterated, coronal suture 
& ts (1368) ae aes a0 ae 143 9 142°5 BES) OT | S|) S27 || Si 128 | 130 | 118 | 103 519 | 715 |; 95 127 55) || 25 41 4E-5 | 36:5 || 36 48 51 38 O04 38 33-5 d d d d d Gee : 6. om 6 iu becoming obliterated. Bulging glabella. Marked malar marginal processes. 
2 77 2 187 | 140 | TOO | 129°5| 1175 | 107 | 315 | 318 | 360 | 125 | 120 | 115 | 97 517 | 735 | 98 135 60°5 | 24 43 445 | 37 38 46 47°5 | 41 99°5 | 36:5 | 31 d d d q 4 oe 77°30 zZ oH 77°30 z “QE | L00-49 | 75°05 | 46:73 | 87-26 | 89:26 | 74-70 | 87-63 | 68°-z 729-5 30°-4 10°+5 28°-9 | 83° Sutures open.| Fluttening of obelion. Bulging parietals. Marked malar marginal process on R. 
3 d d 176 | 183-5 | 183-5 | 133-5| 92 |135 | 114 | 108 | 299 | 206? 28 6 68 : ess 4 eT 9°51 74:87 9:14 | 108-27 75:05 39°80 85-65 85-81 85-89 85°39 69°°7 66°-3 44°-0 BROT 21°-3 89° Very large mastoids. Teeth good. Sagittal suture almost entirely obliterated. Lambdoid suture becoming obliterated. 
i ; Sir 2 290?) 351 106 | 117 | Lor 504 8 103 137'5 54 25 44 43°5 36 d 51 54 37 10 aR I H : est Marked malar marginal processes R. and L. Ridge on parietals R. and L. Deep palate. 
54 5 ; 3 3 375 | 385 q d d d d 63°8 | 72:64 | 73°67 | 72°64 | 73°67 | 98:59 | 66:02 | 46:30 d 81-82 |. 68:52 | 84-45 | 68°0 | 75°:0 | 377-0 82-0 | 29°-0 | 83° Very receding forehead with bulging glabella. Asymmetric flattening of frontal bone. Bulging pavietals. Numerous 
227 d d d d d d d d d d d 337 | 120 | 107 | 110 d 480 d d d d d d d d d d d d d 32 29 d d d d d d d d d i wormians in lambdoid suture. Symmetric flattening of parietals each side of lambdoid suture. 
2 See) i ea ee eee : ; ries 3 | ‘ : : 3 us d d d d d d 90°63 d d d d d od Very damaged, especially face and base. Very flat head with very bulging parietals. Sutures very open. 
lake, Ons) : He hs eal 5 |-toz | 313) || $tos\=g59h|m25, | aro | 115 | 99 | 504 | 76 | t0z | 129 || 57 | 27 | 445 || 4351] 35 | 35 | 46° | 50 | 3575} 98 | 3r | 30° |/rx85 |'tx6 -| tog || 35-5) | 46 | 63:6 || 78-4x | 75:00 75 7 ° ° ° ° 
4 Stee, es 75 shee es ee dae ee x | | 35°5 : 7°°4 75 THUD 7437 | 104°55 74°31 47°37 80-47 79°10 70°80 96°77 64°°5 7I°-0 44°°5 I7°0 27°5 88° Infra-orbital meets malar maxillary suture R. and L. Teeth very worn. Small epipterics R. and L. Sutures very open. 
] 4 les ie 5 BD) 3°5 | 150 3 330 | 330 | 364 | 135 120 | 109 | 97 532 80°5 96 136 59? 27 44 44 38 37°5 51 54 34°5 | 103-5 30'5 30°5 | 116 126 To4 32? 51 606 8r- Fe Sieg : ee A s ‘ Depression at R. pterion. Bulging at mid-sagittal. 
E : m, 895) . F : : ‘ | : o} 74 74 74 7411 | I10:29 3°75 45°76 84-38 86-36 64°25 99°34 64°°8 79-7 AUG | seg /SSit Dap 88° Parietal bulging, with slight depression. Coronal and sagittal sutures much obliterated. Considerable absorption of 
3 470 (1472) 1050 180 182 183 I50 92 14075 | 122 106 331 | 328 | 373 | 124 | 122 | 127 | 108 531 77 109 137 55 25 43°5 42°5 37 36 «| 55 58 42 102°5 23 32 AG Bor 8 ts : alveolar rim. Heavy inion ridge. Mandible fractured. 
(+ m, 1200) = pee A 39 : 128-5 | Io 345 | 52 62-1 | 82-42 | 77-08 | 81-97 | 76-66 | to6-9r | 70:82 | 45:62 | 84:50 | 85:07 | 72°66 | 95-78 | 65°°8 7o°'8 43°°4 Teta BD 82° Very heavy skull. Sutures open. ‘Teeth good, but bad abscess of bone round R. canine. Anterior supra-mastoid tubercles 
é I410 (1406) 712 178 182-5 | 183 142 102 131-5 | 114 IOL 310 | 310 | 369 | 131 | 125 | 113 | 96 528 77-5 | 1or-5 | 144 58-5 27 45 45 375 37 53 59 38 108 325 31 128? | 1320 an ’ 8 2 “s R. and L. Bulging parietals. Flattening at obelion. ~ 
Pi ee z (+ m, 849) ; ae 5 ° Sas 3 4 34 495 | 62:0 | 77:85 | 72:09 | 77°60 | 71-85 | 107-98 | 76:25 | 46-23 | 81-77 | 83-11 | 64-4x | 95°09 | 73°°3 | 63°4 | 43°73 WSUS || Arey || fle}? Receding forehead. Infra-orbital suture meets malar maxillary suture on R. Sagittal suture considerably obliterated, 
410 (1406) bs eee 179 180 182°5 | 13 92°5 | 133 II7 104°5 | 313 | 313 | 353 | 123 | 124 | 106 | 95 510 74 IOI 136 57 27 45 45 36 35 52 53°5 37°5 | 104-5 d 32 d con 103 37 49 Re rae 3:8 6 Eraser : ‘ eel ; : is a af Es : _ Projecting mion. Marked malar marginal processes. Mid-sagittal crest. ; ee : 
4 ores G iB 717) Serge coe . elt ae ee sm aie - - TPT 7 F329) 75°95 72-9 Buf; 73°00 47°37 Jo 12 80-00 69-81 d 69°°3 69°-5 AI°-2 10°-5 24°°7 86 Very receding forehead, and high sagittal crest. Sutures open. Some absorption of alveolus. Anterior sagittal bulging. 
7 $ 4°5 | 14 4© Ze 218 | 387 | 131 | 129 | 127 | 103 545 7 102 13 60°5 27 45 43 37°5 37°5 48 40°5 39 6 29 re ra Marked malar marginal process on R. Asymmetric foramen. 3 
3 1540 (1520) 882 184 | 188 | 190 | 3147 | TOL | 135:5| 119 | 107 | 330 | 327 | 376 | 225 | 124 | 127 | 105-5 | 538 | 75 IIL 143 57 26 40 46 35 34°5 225 50 46:5 ane 36 28-5 4 g p u | f 579 78 12°54 17 TOS pepe 76-47 | 44:95 | 87-21 | 82-80 | 79-19 | 8x-tr | 62°8 | 72°3 | 44°9 | I1%7 | 33>2 84° Receding forehead. Sagittal crest; bulging parietals with some depression. Bulging occipital. 
4 See) ‘6 ee Sete hess on (oe eae ie e s ST | PPR) || TOT 77°37 7X31 | 108-48 | 67:57 |- 45-61 7500 | 75°05 | 82-97 | 7861 | 64°-1 73°°5 42°°4 d d d Sutures much obliterated. Infra-orbital mect malar maxillary sutures R. and L. Epipteric on R. 21 mm. long. Receding 
i I4t Ze 3 315 | 355 | 221 | 120 | 113 | 95 505 7 I10-5 | 140-5 57 25°5 465 4 38 39 52 565 | 38 OO 5 28 ] | ; j F forehead, bulging parietals. Marked malar marginal processes R. and L. 
a5 ee , a Seale - Pal. Pel 35'5 q d ad aq | d 60:8 | 77°05 72-13 7733 7221 | 106-81 68:95 | 44:73 84:78 | 82:55 | 67-49 | 78:89 | 65°-0 71°3 43°°7 157 28°-0 87° Bulging glabella. Receding forehead with sagittal crest. Depression at obelion. Sutures very open. Some absorption 
9? 729 178-5 | 182 182-5 | 133 133 04 7 | 308 | 365 || 123°] rr | 13r | 105. | 506 | 72 LOL 132°5 54 By 45 43°5 34 34°5 49°5 52-5 38-5 | 100 R 31 1 a ad of alveolus. Infra-orbital suture on L. Ossicle at lambda (18 x 18). Small epipteric on R. 
ee 3 : ‘ ze Z | | : Se) a a d d d 571 73°08 73:08 750%: 7291 | 100-00 | 70:15 | 50:00 7912 75°56 | 73:08 | 90:08 | 66°-6 72-8 40°-6 rep 29°°4 84° Low receding forehead. Infra-orbital suture on R. Teeth good. Sutures becoming obliterated at pterion and obelion. 
= ee : ee p Z ; : stated a 
3 97? 185 189 190 134 9 137 5 IIo 99 | 300 | 375 | 125 | 134 | 110 | 95 520 | 67 102°5 | 139 54°5 26 43°5 43? 40°5 39 d d | 37 98 34:5 30°5 d d d do le @ 58-6 70-90 72°49 Poe Seles On aS 5B ane : 7 F BPE x os a % Ossicles in lambdoid suture. Heavy inion ridge. : ; rey ; ; hay ; 
: ee | fOD3 7 97 5°49 47 9 93°02 ) 71°-6 39 140-4 25-2 86 Very narrow receding forehead. Curious high mid-sagittal crest. Bulging occipital with heavily marked inion ridge. 
Qo? 1230 (1244) 667 173 177 178 139:5 | 94:5 | 133°5 | 115 103 317 | 316 | 346) |)£20 | rr2 | 114 | 98 498 | 66 105 131 52 27-5 43 43 38 37 48 50 | 39 Too 3675 3075 d a d d d an 78:81 eG 8. : z 62.8 3 ss P Sutures becoming obliterated. Ossicle at lambda. Some depression of parietals. ; : : 
73°3 ToS 75:02 | 104°57 2°86 | 52°50 | 86:05 88-37 | 78:00 | 83:33 | 68°-1 73°8 38°-1 L402 23°90 88° Infra-orbital sutures R. and L. Marked malar marginal processon R. Flattening at obelion. Bulging parietals. Sutures 
3? 1364 (1306) 647 175 | 181 | 182 | 140 OR || wa || uals || sey || SAO || Ba || ou | Bey | weap || weap | wes 513 75 108 144°5 58 25 46 46°5 37 36 I s 47 IOL- y _ becoming slightly obliterated. Epipteric on R. (25 mm. long). 
5 54 47 5 | 34 29 d d d HN 8 ne sain ; oa 5 e 9 6 Io 5 
: , i : : . ‘ : | 4°9 77°35 74°59 79°92 7418 | 103-70 | 69-44 | 43°10 77°43 80-00 87:36 | 85:88 64°:8 73°-0 42°-2 11°-0 31°-2 84° Tnfra-orbital suture meets malar maxillary suture R. and L, Marked malar marginal processes. Bulging parictals; high 
C) 1445 (1425 25 182 187 187 143 97 132 ir TO! 31 320 | 370/225 | 123 || 122 | 102-5 | 532 0-5 | Log 143 62 2 4 44°5 7 ‘ 52 5 2 | mid-sagittal crest. Sutures open. Teeth worn. 
ate fe 2 RY 3/ ot ue oa f°. 102 35 3t a a d a | d 615 78°47 79°59 | 76:47 | 70°59 | 108-33 | 77:21 | 40:19 | 83°15 84:09 | 76°53 88:57 | ~ 64°-3 7O°-1 45°-6 Goad 28°-2 87°-5 | Anterior eeiial inline, Tooth much worn, Curious ridge on occipital base. Flattening at obelion. Aural exostosis 
3 1420 (1430) 670 180 186 187 148 2am RLS 5 II5 103-5 | 318 | 318 ; 368 | 124 | rrr | 133 | 110-5 | 522 | 83 108 144 62 26 475 46 39 38 48 BR fe Io 35 29 d d d d d 6 : a ; on the L. 
= 59 79°57 7247 79:14 71-01 | 109-79 7793 41-94 82-61 82-11 76:00 82-86 64°-0 67°:8 48°-2 13°°2 35°70 8r° Heavily developed superciliary ridges. Low receding forehead. Infra-orbital sutures R. and L. Coronal suture open; 
; 3 | sagittal suture becoming obliterated. Many ossicles at lambda, and in lambdoid suture. Occipital bulging. Heavy 
1318 (1335) 845 177) 181 ISI 140 92 137 118 99°5 | 327 | 320 | 365 | 126 | 120 | L19 | 102 507 | Ti 104 127-5 5555 24°5 415 42°5 355 36 46 49 35 06 37 209 d d d d d 6x6 eee PG i. | inion. Very marked supra-mastoid tubercles R. and L. 
2 A otal en 3 77°35 |. 75°69 | 77°35 | 75:69 | 102-19 | 74:04 | 43-06 | 84:63 | 85-95 | 71-02 | 78:80 | 64°2 | 69%4 | 46°4 iP) || APES |) BP Very heavy mastoids. Flattening of obelion. Very heavy skull. Receding forehead. Malar marginal processes R. and 
| y OK ¥ y g 8 Pp 
78 (1366) 635 ce 183 14 5 ms 93 5 1335 ee = 285 oa se er lessees os aoe Ue  beS 33 Be — iB 33 38 ie mS Bre 22> ||. 2 38 d g w d | a 62-7, | 73:66 | 75:00 | 72-04 | 74:28 | 82x | 78-22 | 45-09 | 81-40 | 81-82 | 65-78 | 83-79 | 637-8 To | 45°:2 62-2 : 87° Bulg erabel gee nee Eee je 
5 7 z i 43 . : 2 3 2 2 52 45 3 5 2 43 43 3 3 4 49 33-5 b d ots 2. : = g 2 Sees 4) 5 102 5 i é 7i- a 16-2 29:0 7 wiging glabella. eceding forehead. 
i | > ae eo 38 g Z a @ 63-4 | 7814 | 72-95 | 77°50 | 72:28 | to7-1r | 75:25 | 44°83 | 88-11 | 87-96 | 78-16 | 94-29 | 63°0 | 72°0 | 45%0 | 12°%0 | 33°%0 | 84° Very marked high sagittal crest. Asymmetric occipital bulging. Marked malar marginal processes. Small epipteric R. and L. 
| | | : | | | | Complex lambdoid suture. Teeth good. Para-occipitals, especially the left, are large. ‘Two tubercles on the basi-occipital. 
| | | | | | | 2 gs Pp P y 8 
! i 


The measurements are tabulated to the nearest 0°5 millimetres; the indices are given for the full measurements, using Ernst’s Tables. 
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Biometrika, Vol. XI, Parts I and II 


REMARKS 


ltals R. and L. Sutures open. Teeth worn. Palatine bridge on R. Marked occipital ridge. Ossicle at lambda. 
lepipteric on L. Marked malar marginal processes. 

tal suture on L. Marked malar marginal processes. Small epipteric on R. Open complex sutures. Small 
le on occipital base. Slight depression at obelion. 

ry skull. Infra-orbital suture on L. meets malar maxillary suture. Coronal and sagittal sutures much obliterated. 
ossicles in lambdoid suture. Marked malar marginal process on R. Depression at obelion. Very small mastoids. 
m. Sutures considerably obliterated. Marked malar marginal processes. Depression at obelion. 

ialar marginal processes. Receding forehead. Sagittal suture considerably obliterated. Bulging occipital. 
good. Small epipteric on R. Bulging parietals. Ossicle in lambdoid suture. 

[tal suture on L. Slightly bulging glabella. Coronal suture becoming slightly obliterated. Inion very pronounced. 
absorption of alveolus. Sagittal crest marked. Flattening of obelion. 

forehead. Symmetrical flattening on either side of sagittal suture. Flattening of obelion. Sutures open. Teeth 
| Some parietal bulging. 

ding forehead. Sutures open. Heavy inion ridge. Anterior supra-mastoid tubercles R. and L. 

vy skull. Sagittal suture almost entirely obliterated; coronal and lambdoid suture becoming obliterated. Marked 
idge. Flattening of obelion. Teeth good but worn. Deep palate. 

ling forehead. Infra-orbital sutures R. and L. Anterior supra-mastoid tubercles R. and L. LEpipterics R. and L. 
palate; worn teeth. Group of ossicles at lambda. 

y receding forehead. Considerable parietal absorption. Coronal and sagittal sutures much obliterated. Large 
ids. Bulging occipital. Teeth very worn. 

malar marginal processes. Sutures open. Ossicle at lambda. Anterior supra-mastoid tubercles R. and L. 
ig occipital. Depression at obelion. 

y receding forehead. Sagittal suture much obliterated. Teeth worn. 

suture entirely obliterated. Marked mid-sagittal crest. Coronal suture fairly open, lambdoid suture much 
rated. Epipteric on R. Anterior supra-mastoid tubercles R. and L. Marked inion. 

ry worn. Sutures slightly obliterated. 

ital suture meets malar maxillary suture R. and L. Slight post-coronal constriction. Sutures open. Marked 
| marginal processes, Slight asymmetry of foramen magnum. 

pen. 

pa Infra-orbital suture meets malar maxillary suture R. and L. Depression at pterion on R. 

eae good. No 3rd molars. Suturesopen. Flattening at obelion. Epipterics R. and L. Palatine bridge on L. 
st-coronal constriction. Bulging glabella. Sutures open. Infra-orbital suture on L. Basilar synchondrosis. 
of skull extremely asymmetric. Mid-sagittal crest. Third occipital articular surface. 

a suture meets malar maxillary suture R. and L. Small epipteric on L. Sutures open. Small ossicles in 


1, lambdoid and sagittal sutures. Bulging occipita . . 

ery open. 3rd molars not fully developed. Epipterics R. and L., large on L. (36°8 x 11-0 mm.). Ossicle on R. 

nbda (25 x 26-5mm.). Bulging occipital. 

ng. Basilar synchondrosis. Infra-orbital suture meets malar maxillary suture on R.; not on L. Malar marginal 

sses R. and L. Small ossicles in lambdoid suture. 

Basilar synchondrosis. 3rd molar on L. not yet erupted. Small epipteric on R. Flattening of obelion, 

ling forehead. Bulging parietals. Sutures open. Some bulging of occipital. 

ing. Basilarsynchondrosis. Some of 2nd teeth not yet erupted, stillin alveolus. Surface of bone much mutilated. 

epipteric on R. and L. Many ossicles in lambdoid suture. Ossicles very open. Infra-orbital sutures R. and L. 

siderably damaged. Sutures very obliterated. 

damaged. Sutures very open. : 

ing forehead. Bulging and projecting parietals. Markedly good set of teeth in both lower and upper jaws. 

aoe processes. Coronal and sagittal sutures considerably obliterated. Occipital bone asymmetric. 

g glabella. 

ecoming obliterated. Parietal depression. Teeth extremely worn. Some absorption of alveolus. Bad abscess 
e on L. maxillary. Mid-sagittal projection. 

[nd coronal suture much obliterated. Teeth very worn. Low receding forehead. Bulging parietals. Marked inion. 


pital suture meets malar maxillary suture R. and L. Coronal and sagittal sutures becoming obliterated. Some 
ping of obelion. Marked malar marginal processes. Teeth extremely worn and decayed. 
'Yy worn; sutures becoming slightly obliterated. Infra-orbital suture meets malar maxillary suture on L. Marked 


f marginal processes. Slight bulging at mid-sagittal. Bulging parietals. 
icle on L. occipital, 51 x 31 mm. Heavy projecting inion. Small epipteric on L. Sutures becoming obliterated 


rion; and obelion. Teeth very worn. 


ne 


Lengths 
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Capacity and Weight 
is y mes 8 = Arcs Face Nose Orbit Palate and Profile Foramen Mandible Indices Angles 
am ~ Aw — Xx = ee OF a ~ A. Se —— ia > 23 a im 
No. Sex (Gj W. in ers. | F L’ L B B H 0. y p 4 
Hs | oO |e | © 1 || se | w | er | ea) 7 | |e lon lon lon lon |e peal @& | en | fe fo len | a |e | | | en | aye | ge | tym | eye | are | ore evens) OP | Se chi, a ye | Ae |) Be 8, 6, PL REMARKS 
765" 3 1440 (1444) 785 185 187 188 1425 | 90 140 I19 111 22 22 6 | 126 2 8 : co ; | é < ¥: 5 5 
: SR | Sie | SPO Tee Gee iesie || eter) a g 985 | 138 61 25°5 47 43°5 37 37 SED 50 36 106°5 30°5 29 d d d d d 50:9 76-09 74:87 75°73 74:49 | IOr-64 81-01 41-94 85°51 79°14 64:29 95°70 65°-2 715 43°3 18°-8 24°°5 go° Infra-orbitals R. and L. Sutures open. Teeth worn. Palatine bridge on R. Marked occipital ridge. Ossicle at lambda. 
765° g 1440 (1425) 605 182°5 | 185 | 186 | 139°5| 99 | 132°5 | 11 108 I 8 | 12 5 5 Small epipteric on L. Marked malar marginal processes. eka 
‘ SES SLB Rag | 127)| Tt Sa | TECH 03 57? a 793 133 55°5 28) 44 45 3455 34 32 545 40 107 365 32 d d d d d 68-7 75°41 71°56 75:00 7E-18 | 105-36 68-93 51:89 79°53 7795 73°40 87-91 7O°-2 7223 38°°5 10°-7 27°78 82° Infra-orbital suture on L. Marked malar marginal processes. Small epipteric on R. Open complex sutures, Small 
7657 3? 1390 (1404) 899 183 | 184 | 185 | 146 96 | 140°5 | 120 6: 2 tubercle on occipital base. Slight depression at obelion. ; j : 
: : : PEN SHEN SS | SI Wea See) eect es) 526 74 190 740 55 23 ait 43 38 38 3x9 52 36 Ior 33 31 d d d d d 59°7 79°35 76:36 78-92 75:95 | 103-91 73°85 41°82 87-50 86-66 64:49 93:94 65°-0 73°3 Ale 10°-7 31°-0 84° Very heavy skull. Infra-orbital suture on L. meets malar maxillary suture. Coronal and sagittal sutures much obliterated. 
7652 oO? d 630 180°5 | 185 185 139 102 127 118 103 317 | 317 | 363 | 123 | 12 117 | 93 517 70°5 04 7 6 6 i * ane, eee in Jam pacigsuaUe, pee mela nonae prceees on a DepEeslons at obelion. ey ad mastoids. 
22 I 3 2 5 ; = : : 3D | S25 | 2 43 43 3 35 47 49 4I 96 d 27 d d d d d 56-6 5-1 68-65 I 68-65 | 109-45 78 9:42 | 81-40 | 83:72 | 83-26 d 64°-0 Pg | 41%-2 15°r 26°-0 | 90° eeth worn. Sutures considerably obliterated. Marked malar marginal processes. Depression at obelion. ig 
765 3 T1510 (1525) 734 186 | 189 | 19r | 140 93 |142 | 119 | 112 | 311 | 321 | 370 | 126 | 121 | 123 | 109 528 78:5 | Log 137°5 | 58-5 29 46 46 38:5 375 | 525 | 50:5 || 38 107 40°5 32 a d d d d 67-9 on Agnes one ee eee Tithe ine 81-08 84:06 | 67:14 | 79:02 | 65°5 Bee 42°0 10°5 31°°5 84° Heavy malar marginal processes. Receding forehead. Sagittal suture eoasageally obliterated. Bulging occipital. 
7652 2 I218 (1216) 654 178 179 18r 133 97 132°5 | 113 100 297 | 300 | 358 | 123 | 119 | 116 | 96 508 70 Ose) | 262 3 e Se 6 ‘ . : ‘ - . Teeth good. Small epipteric on R. Bulging parietals. iDanale ae lamb a ee eee aa 
F 4 3? Ze AES) 42 35 3 47 ao 35°5 97 37 30 d d d d d 59°4 74:30 73°96 73°48 73°14 | 100-45 73:22 53°85 86-11 82-36 70:60 81:08 67°-0 71-2 41°-8 r1°-8 30°-0 83 Infra-orbital suture on L. Slightly bulging glabella. Coronal suture becoming slightly obliterated. Inion very pronounced, 
765% Qe? 1440 (1425) 662 18I 182-5 | 185 144 95 138 117 | ror I I 64 | Ir r20 | tr 5 I 3 Some absorption of alveolus. Sagittal crest. marked. Flattening of obelion. ; 
3T3 | 373 | 304 TS LANES. au 7 99 ESE 32 265 42 415 33°5 33°5 48 oe 39 Tor5 36°5 31 d d d d d 62°4. 8-85 75°5 8. 59 | 104: I- 53°00 80-48 79°90 -00 85: 69°-9 69°: 41°-0 I0°-9 30°°1 80° Receding forehead. Symmetrical flattening on either side of sagittal suture. FPlattening of obelion. Sutures open. Teeth 
7 || TREE || TAPS 3 | PSY 75 5°39 a 8 ae Bala 
Tepe To80 Rae 384 88 me ion eg z oe, eel a |e |, O28 Basa eel eee Eee Bs JOD | Oe 143 a7 20 43 45 37 36'5 49 545 | 42 100 41 33°5 d d d d d 60-8 74:87 75°40 73°68 74:21 99:29 73°55 46-96 81-02 85:58 76-69 81-71 63°°4 73°:3 43°°3 9°°7 33°°6 83° We ealing torctioadi) Sutearonent Heavy inion ridge. Anterior supra-mastoid tubercles R. and L. 
ue é ere) 2 Sag bese) a NSD | es || Oey | SA || S324, | SSO aso | seas | aslo | ai 542 | 805 | 102 1355 | 63°5 | 24 44 44 38 38 5° 52°5 | 37 96 33 30 d d d d d 61-4 | 76-96 | 71-93 | 76-64 | 71-63 | 106-98 | 78-86 | 37-80 | 86:36 | 85:90 | 70:36 | go-91 | 60°4 | 72°4 47°-2 TAA) gzeeunltarsi7e Large heavy skull. Sagittal suture almost entirely obliterated; coronal ane lambdoid suture becoming obliterated. Marked 
7652? 3 1360 (1368) 830 176 182 183 144 QI 137-5 | 119 104 322 | 324 | 358 | 125 | 104 | 129 | 109-5 | 516 75°5 | 108-5 | 137 _ & a ae : E 3 ‘i inion ridge. Flattening of obelion. Teeth good but worn. Deep palate. Fe eer cis ue MOT nA Ea 
: i S 55 23) HPD) 40°5 39 39 515 53 41 105°5 40 32 d d d d d 61-6 79:01 75°54 78°57 75°13 | 104-58 69:44 41-82 84:2 84-01 77:65 80-00 70°-0 67°:8 42°2 16°-8 25°°4 85 Low receding forehead. Infra-orbital sutures R. and L. Anterior supra-masto1 tubercles R. and L. Epipterics R. and L. 
765°° 3 d 872 I9I | 196 | 197 | 142 94 | 1365] 117 | 108 | 311 | 312 | 378 | 126 | 122 | 130 | 103: I 3°5 | Io I 3 Large palate; worn teeth. Group of ossicles at lambda. a Ta OBI tomate 
3 35 | 353 7 5 af 59 26 47 46 37°5 38 oH 54°5 41 107 37 33 d d d d d 56:8 72°45 69-64 72:08 69:2 104-03 69:97 44:07 83-04 80-00 75°50 89-19 69°-o 71°-0 40°-0 13°-0 27°°0 84° Extremely receding forehead. Considerable parietal absorption. Coronal and sagittal sutures much obliterated. Large 
po 2 : 2: : R ; mastoids. Bulging occipital. Teeth very worn. : t ; 
769 3 d 904 191 192 193°5 | 150 99 ae 119 I1I-5 | 340 | 339 | 384 | 135 | 118 | 131 96°5 544 81 102 138°5 58°5 23 43 44 39 38°5 475 49 36 103 35°5 33°5 d d d d d 54:0 78-13 74-11 77°55 73°57 | 105-41 79°41 30°32 87:50 90-70 73°47 94-29 62°-2 73°°3 44°°3 13°05 30°-8 87° NONE inpererton yee a putures open. Ossicle at lambda. Anterior supra-mastoid tubercles R. and L. 
760° 3 d jot 177 | 183 | 184 | 137 97:5| 139 |120 |a1r2 | 324 | 323 | 352 | 113 | 132 | 106 | 93:5 | -508 | 73 99°5 | 14355 | 56:5 | 28 44 8. - ie : : a & PAN : : ' h 
j ; F 2 F : : 2 45 39 39 || 9 55 39°5 | 113 S35 || Ses d d d d d 67- 4°86 5:96 “46 Dy 98-56 y 50:09 | 85-61 | 87-78 it 93:94 2°-0 o°-0 | 38°:0 | 12° 2503 | 83 Extremely receding forehead. Sagittal suture much obliterated. Teeth worn. y : 
765" i) ZO40 (1654) 982 194 | 198°5| 199 | 150 | 104 | 137 | 120 | 109 | 333 | 333 | 392 | 132 | 145 | 114 | 98 555 77 100 139 58 24°5 43 44 38 a 49 De 405 | 10275 34 30 d d d d d con ie ae Li 6B84 aot fean 41-89 92°76 86.48 fate are b4%0 Pras Hemet en 3170 87° Sagittal suture entirely obliterated. Marked mid-sagittal crest. Coronal suture fairly open, lambdoid suture much 
765%2 9 ; 1300 (1292) 602 175°5 | 178 |179 | 139 93 |130 | 124 | 103 | 304 | 304 | 350 | 120 | 120 | 110 | 96 505 B 96 134 54 25 46 38 38 47 6 8 F F A F ae - ae F 5 5 5 ‘f OT ee Hpipseng Sr ie eee supEereaiend tubercles R. and L. Marked inion. 
178 5 2 76 2 44 39 93 sD || 2 d 5° +09 0 6 2-6 93° 6:2 : “90 aan 8-20 30 | 61%1 On, 3° g°o | 34°5 4°°5 eeth very worn. Sutures slightly obliterated. ; Rh 
765°° ® 1180 (1178) 505 168 | 170 | 170 | 135 92 | 131 | 112 | ror-5 | 300 | 300 | 340 | 115 | 116 | 109 | 94-5 | 486 69 96 wah 2 23 43 44 37°5 36'5 45 49 34 2 33 27 d d d d d Gre Dae ree ee ae eae ee ies aoe 86-97 bam sees 61°-7 Be ee . 9°:0 3223 86° Infra-orbital suture meets malar maxillary suture R. and L. Slight post-coronal constriction. Sutures open. Marked 
765% fe) 1360 (1349) 635 180 | 180 | 180 | 141 88 | 131-5 | 174 | rox 304 | 304 | 364 | 124 | 118 | 122 | 103-5 | 508 65 93 125 49 26 39 39 33 32°5 43°5 44°5 35°5 95 34 30°5 d d d d d 61-8 FSee apron BOD apn | topo CaS soy roy Bede se ee ase isa 38°-8 rs iO ae cutie, ee processes. Slight asymmetry of foramen magnum. 
765° z Leen 228) ae 1725 | 1735 | T75- | 133 89:5 ae poe GH | BHs | 2S | 312) | mul | wey | aay | ge) on 2 93 125 54 24 43 43 36 35 5° 53°5 | 34 92 35 29 d d d d d 615 | 76:66 | 73:54 | 76:00 | 73-48 | 104-23 | 75-27 44:81 | 80:93 | 83-72 | 63-177 | 82:86 | 63°4 | 73°4 | 43°12 82-5 | 34°°6 | 82° Sutures open. Infra-orbital suture meets malar maxillary suture R. and L. Depression at pterion on R. 
765" 2 eos (1273) x9) Ue |) ies 475 __ | 743 95 | 33 WE BES) 322))|| 322) 19345) | 115 || EXO || w14 | 98:5, |) 500 Te WS || X32 55 23°5 44°5 43 39°5 39 44°5 475 | 40 cope 2°5 2 d d d d d 63:8 82-18 78-16 81-71 77-71 | 105-14 | 73:99 42°36 90°70 89-16 84-22 84:21 58°°3 79°°4 42°°3 10°-6 Bias 90° Young. ‘Teeth good. No 3rd molars. Suturesopen. Flattening at obelion. Epipterics R. and L. Palatine bridge on L. 
765 e 1400 (1392) 632 173 | 1775 | 1775 | 1445 | 93 | 135 | 119 | 105-5.| 320 | 320 | 362 | T2r | 126 | 115 | 106 5r2 || 7 94 120 53 23°51) 4335) |) 44 SS |) 35 |) ZS) Do) 2 IOI 33 29°5 d d d d d 79:2 | 81-36 | 76-05 | 81-36 | 76-05 | 107-03 | 74:57 | 44:34 | 82:50 | 86-40 | 78-42 | 89:39 | 66°2 742-0) ||| 399-9 7-0 | 32°9 | 81° Slight post-coronal constriction. Bulging glabella. Sutures open. Infra-orbital suture on L. Basilar synchondrosis. 
38 i 6 3 : é Base of skull extremely asymmetric. Mid-sagittal crest. Third occipital articular surface. \ : 
765) Child | 1360 (1349) JOH} 173 E73 || BFS || BO GIO || USSD | 205) 92 312 | 312 | 352 | 119 | T2r |) 112 | 93 487 61 82 d 45 Zo 37 37 35 35 ye 44 30°5 89 34 29 d d d d d 59°7 81-01 75°23 81-01 75:23 | 107-69 74:20 44-41 94°59 94:05 68-77 85-79 67°-1 73°°4 39°°5 10 -I 29°°4 83°°5 eee suture eee maple gee eure R. and i Small epipteric on L. Sutures open. Small ossicles in 
5 coronal, lambdoid and sagittal sutures. ging occipita . : : 
765" | Young 9 d SEO 185 | 185 | 186 | 143 OH | wales | ug 96 | 309 | 310 | 367 | 124 | 106 | 137 | t1m-5 | 517 | OF 87°5 a 47 26 37 39 365 | 34 39 41 2 85 385 | 33 d d d d d 58-1 | 77-30 | 68:38 | 76-88 | 68-51 | t13-04 | 69:87 | 55:32 | 87-40 | 98-92 | 78:05 | 85-73 | 60°8 | 80°3 | 38°9 Oy || s2 || BP Sutures very open. 3rd molars not fully Sevolinedt Hpipterics R. and L., large on L. (36:8 x 11-0mm.). _ Ossicle on R, 
thi 5 of lambda (25 x 26:5 mm.). Bulging occipital. 3 
TOD Sia ora ee TS NEDSS) MSY SD Se aS SE sie P| SO sees eT | ae es PEIN 22D | SB) 37 oe 3? 385°] 41 gH || 80 ANS) 2 d d d d d 66-0 | 78:06 | 74:92 | 79-19 | 75:24 | 105-38 | 67-05 | 51-13 | 86-49 | 82:05 | 76:34 | 92:15 | 59°-0 | 82°3 | 387 8°-7 | 30°0 | 91° Very young. Basilar synchondrosis. Infra-orbital suture meets malar maxillary suture on R.; not on L. Malar marginal 
i ; processes R. and L. Small ossicles in lambdoid suture. acta : R 
765" Child | 1390 (1387) 485 nS me ee 93 | 725 Ea BIH || JW |) JO7 | SHO | HAG || HAO || sas | OH oF4 Ge 89 d ot 2A 40 39) 35 34 ANd] 49 345 94 35°5 32 d d d d d 60-6 79°12 68-79 78-69 68-41 | 115-01 76-96 47-24 87-18 87-18 70:00 90-65 65°-0 Tei 41°°3 10°-3 31°-0 84° Young. Basilar synchondrosis. 3rd molar on L. not yet erupted. Small epipteric on R. Flattening of obelion, 
i 8 } Receding forehead. Bulging parietals. Sutures open. Some bulging of occipital. : 
7657" Child d Awe 169 | 170 | 170 | 133 GO | 82 1i2 87 | 305 | 304 | 350 | 118 | 125 | 107 | 94 480?) 58 88-5 | 114 43 21 38°5 38 35 34 38 41 34 78 39 29 d d d d d 66:3 78-11 75:29 | 78-11 75°29 | 193:75 | 65-91 48:60 | 88:94 | 90:38 | 82-43 74:30 | 61°2 77°:8 41°-0 9°:2 31°:8 87° Very young. Basilar synchondrosis. Some of 2nd teeth not yet erupted, still in alyeolus. Surface of bone much mutilated. 
Small epipteric on R. and L. Many ossicles in lambdoid suture. Ossicles yery open. Infra-orbital sutures R. and L. 
7653 g?? d d 188 ; g 138 : d ues d d d d | 311? d 128 | 147 | d d d oe # d d A752?) 20 d 42 d 34 d d d d d d d d d d d d d d d d d d d d d d d d d d d d d Skull considerably damaged. Sutures very obliterated. 
765°" 9? d d 177? Bec 1375| 9 i d d d d i TAS | wan |) dl d d g d d d d d | d d d d d d d d d d d d d d d d 76:39 d d d d d d d d d d d d d d Skull very damaged. Sutures very open. , 5 : : 
765 3 370 (1387) 686 174 | 180 | 18r | 139 94 | 132-5 | 112 99°5 | 303 | 303 | 360 | 124 | 111 | 124 | 107-5 | 506 7 107 135 58°5 AED 46 45 37 36 5°°5 925 AZ 95°5 375 30 d 121 IIo 32 57 64:3 77:22 73°50 76:80 73°09 | 105:06 72°71 41°53 79°55 80-78 80-38 80-42 63°°8 69°°1 47-1 II°-9 35°°2 81° Very receding forehead. Bulging and projecting parictals. Markedly good set of teeth in both lower and upper jaws. 
: , Marked malar marginal processes. Coronal and sagittal sutures considerably obliterated. Occipital bone asymmetric. 
: Bulging glabella. i 
765"° 3 1340 (1330) TTS. LISP | aa ASS pes) CES | Oa | wee Sue || Gm | SOR sees ae | seg || els 5°9 HED || 28 age 57°5 ae 46 47 af) 395 | 48 497515) eS oz 36 34°5 | 117-5 | 123°5 | 104 345 50 65:2 76°80 | 77:35 75°96 | 76:50 | 99:29 | 76:21 45:22 | 83:82 86-96. | 66:73 | 95:27 | 57°8 75-1 47-1 9°-4 31°-7 84°:5 | Sutures becoming obliterated. Parietal depression. Teeth extremely worn. Some absorption of alveolus. Bad abscess 
CES) of bone on L. maxillary. Mid-sagittal projection. ; : F ee 
7657 T160 (1168) ( es. IPS || 72 | 2 | AS 93 | 120 |'x10 | rox-5)| 307 | 307 | 344 | 117 || 108!) T19)}| ror-5) | 492 76 utero) LAPS) || SH 22 Az 42 37 35 Ses) |) 58) 40 98 30 30 T19°5 | 117 102 31 515 62:2 79°07 73°26 | 78-01 72:83 | 107-94 | 76:00 | 38:94 | 82:46 | 88-10 | 75:00 | 100:00 | 65°-0 O° 44°90 12°-9 32°'0 83° Sagittal and coronal suture much obliterated. Teeth very worn. Low receding forehead. Bulging parietals. Marked inion. 
+m, 
7657° 9? 1310 (1311) 599. 173 |174 | 175 | 138 915] 136 | 116 | 105 | 315 | 315 | 351 | 122 | 111 | 118 | roo 497 67 HOO 135 52 25 45 45 34 33 485 | 52 37 98 35 30 1045 | 121 104 28 515 | 61-7 79°31 78:27 78-86 | 78-33 | 101-32 | 67-00 | 48-08 73°45 75°89 71-81 85-71 65°:0 76°:8 38°-2 IO°-2 28°-0 87° Tnfra-orbital suture meets malar maxillary suture R. and L. Coronal and sagittal sutures becoming obliterated. Some 
Cr eee) ; flattening of obelion. Marked malar marginal processes. Teeth extremely worn and decayed. 
765*° 1370 (1368) 814 183 | 187 | 189 | 140 93 | 14-5] 121 | 108 | 323 | 323 | 373 | 127.) 125 | 120 | Lo4 522 | 78 13 129 56 26 445 | 45:5 | 38 37 5° 53 36 102°5 | 34 29 d d d d d 63-4 | 74°87 | 75-72 | 74:07 | 74:9r | 98:87 | 75:87 | 46:43 | 80:96 | 85:40 | 67-92 | 85:88 | 64°5 | 72°2 | 43°3 Ss || ayo || Be Teeth very worn; sutures becoming slightly obliterated. Infra-orbital suture meets malar maxillary suture on L. Marked 
malar marginal processes. Slight bulging at mid-sagittal. Bulging parietals. — i : 
765" a) T460(1454) 20 187 res 189 PS MOSFD || 2S) #19. ne STONES NGM Rag 152) |1207) 130) 0413 53° 73 TO3 132 5t5 ee 44 44 36 36 48 ae 37 94 37°5 31 d d d d i 64-7 76:06 7181 75°66 71°43 | 105'92 71-06 43°10 81-44 81-82 72°55 83-10 63°.0 73°°0 44°:0 T1°-0 32°-0 85° Large ossicle on L. occipital, 51 x 31 mm. Heavy projecting inion. Small epipteric on L. Sutures becoming obliterated 
at pterion; and obelion. Teeth very worn. 


The measurements are tabulated to the nearest 0°5 millimetres; the 


indices are given for the full measurements, using Ernst’s Tables. 
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left ramus of the mandible has been broken right across about the level of the 
alveolar ridge, possibly sheared across by a chin or side blow. Plate XX shows 
the right profile with the ramus and right side of the corpus normal; the only 
very obvious defect being the shortening of the mandible relative to the upper 
jaw. Plate XXI, the norma facials, shows the asymmetry produced by the accident, 
the whole of the lower jaw is tilted. The source of this is explained on examining 
Plate XXII with the left profile; we see that the ramus has been broken right across, 
and both corpus and ramus on this side have ceased to develop properly. The 
posterior edge of the ramus is no longer nearly vertical as on the right profile, 
but slopes forward to a diminutive perigonal region. The whole mandible has, 
so to speak, been rotated round a vertical axis to allow for the dwarfing of the 
left side of the corpus, thus causing the upper jaw to protrude as seen in the 
profiles, and further has been rotated round a medial horizontal axis to allow for 
the dwarfing of the upper portion of the left ramus; the combination of these 
two rotations has given the remarkable skew effect produced in the norma facialis. 
The subject must have had a difficult time during life, but appears to have long 
survived his original trouble. 


10. Conclusions. While I am by no means fully satisfied with this study of 
the Moriori skull and see many ways in which, if I could restart this investigation, 
it might be substantially improved, yet I must plead that what I have done has 
been sufficiently laborious to occupy fully my year’s tenure of the Crewdson 
Benington Studentship. I can only say that experience has taught me certain 
difficulties which could be rectified should opportunity again offer, and that the 
detailed account I have given of these difficulties may aid others who plan cranial 
contour work. Further I hope to have provided an account of the Moriori skull, 
based on wider data than have hitherto been available; it will, I trust, in the 
future assist in settling racial affinities as more and more cranial material is 
reported on in a standardised form. If the racial affinities I find in the Moriori 
do not wholly coincide with the views of more authoritative anthropologists, 
they may at least serve as suggestions for the further examination of the primitive 
races who still or till recently bordered the Pacific Ocean. I feel confident that 
in the craniology of these races will be found a definite key to the evolution of 
man in prehistoric times; there have, I believe, been more extensive folk- 
wanderings, possibly rendered feasible by geological changes, than we have yet 
recognised. Craniological material from the outlying islands, and the fringes 
of the Pacific continents—if feasible of extinct races—is what one must above 
all things desire. 
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Note on the Probable Error of the Coefficient of Correlation in the 
Variate Difference Correlation Method. 


By A. RITCHIE-SCOTT, B.Sc. 


In Biometrika, Vol. x. p. 278, Dr Anderson of Petrograd has given the following expression 
for the probable error of the correlation coefficient between differences of the mth order. 


Probable error Of 1x = *67449 mopey 


ee 3 
= 67449 a | i [nm + 2(n-m-1)(-",) 
Jn — m n-m m+1 


m.m—-1 \3 m.m—-1.m-2 \? 
+2(n-m~2)(—2—) +2(n-m~3) (= SSS) + ete], 


where m = order of difference, 


n = size of original population (i.e. number of years in series of observations etc.). 


In this formula the population is supposed to be reduced by unity at each difference taken. 
It is, however, found more convenient in practice to keep the population constant by taking in 
an additional observation at each successive operation of taking the difference. If we denote 
this constant population by n’ the expression within the brackets reduces to 


wo[ m+ 2(n" ~ 1) Ll V+ 2(n’—2) (Se T+ 2(n! — 8) (Bo) tet | 


m+ m+1l.m+2 m+1.m+2.m+3 
1 ee 2 m.m-1 \2 
ee ard) er) 2 ere 
= ae 2 
si 1-3 es eae + ete. 
V]}\m+1.m+2.m+3 
m \* m.m-1 3 m.m-1l.m-2 \2 | 
Se ee ee |< Ga eee + etes} 
2 m \2 m.m-1 2 m.m-1.m—-2 \2 
on me m+l.m+2 a te. 
: = A(t) +2 (Geass) +8a ei wets) toto 


= 2p (m) - ha AL 


The series p (m) and f (m) have been summed as follows by Professor Pearson: 


we le m ae AS ae mom a Tat) ete 
Ph ela ra m+l.m+2 m+1l.m+2.m+3 ; 
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*. Multiply both sides by ( ee) 


(ini) #0 : fan) oa) eS Fa 72) tte, 


al Ge Oat) ae Gree oe (Ore 


= 13 (0")2 iu 1 (C?”")2 
Lol ale al’, 
(since the number of terms is odd) 


= 4 {0 + (C")5 
4m m \2 
“Halt 


o(m nfl (Gey. 7 


The series 
m: \e m.m-—-1 \? m.m-1.m-2 \? 
fim) = (579) Ge ea) Gar ence) + ete. 


has the value } m, a discovery I made when starting to table it. It is a particular case of a more 
general series ‘which may be summed as follows: 


Consider the following identity: 


= ee eNO 
(m+m—1 m — 1)? 7 ea ee & m “y. 
4m m+m 


m—m — 2\2 
tiply both sides b ( es 
Multiply both sides by aig ae 


Then 
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The process may be continued, thus 
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Assume now that this operation has been performed g times and that as a result the following 
equation is identically true: 


{m - (m-q+1)P _ m ~ (m - q + 1)\? 
4m = (m - 9) (> } 


+ (mga Pam ma (mad) Vs ete 
m+(m-q) m+(m-q-1) 
Operating as before, we have 
{m +(m-q+1)P _ _— , {m-(m- 9+ IP 
TS —* (m-q+1)+ 7% : 
= —~q+2))2 i # 2 
Multiplying by a, and inserting the value of ae = ne) we have 
m+(m—- qt 
— = 2 Sue ae, pt, = 2 
{m — (m 2A OE = (m -g4 i) aay 
4m m+ (m-q+1) 
+ (m-q) {Pa ae) ei 
m+(m—- gq) m+ (m - 1) 
+ (mga (Paar m —(m ~ q) _m —(m- q+ 2)? 
m+(m-q) m+(m-gq-1) m+(m- 1) 
+ ete. 


Hence if the expansion be true for g + 1 it will be true forqg +2. But it has been shown true 
for 1 and 2 and it is therefore true generally. 


{m= (m~q+1)} 


If now in the expansion for we put g= m — 1, we get 


4m 
m m \2 m m — 1\2 a Pe RON 
qlee | +2(" i 5) +3(—" eee +... =f (m). 
Collecting these results together we may now write the probable error of 7, 
ete m \2 
= 67449 ——_ {2 Se ee, 
Jn’ ok) 2n’J 


Values of the function (2¢ (m) — 1) have been calculated for values of m from m= 1 to m = 20 
and are given in the table below. 


| m 2p (m)-1 m 2p(m)—-1 
1 1-5000 Il 4-22.82 
2 1-94.44 12 4-4100 
3 2:3100 13 4-5845 
4 2-6265 14 4-7527 
5 2-9094 15 4-9151 
6 3-1673 16 50723 
7 34059 17 52248 
8 3°6289 18 53730 
9 3-8390 19 5-5172 
10 4-0383 20 56578 


After m = 20, the approximate value 
2 (m) — 1 = 1570796 (m + °375) 


given by Stirling’s Theorem is close enough for practical purposes, e.g. m = 20, it gives 5:6573 
or 5:6578. 
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On certain Types of Compound Frequency Distributions in which the 
Components can be individually described by Binomial Series. 


By KARL PEARSON, F-.R.S8. 


Certain difficulties with regard to the interpretation of negative binomials, which are of constant 
occurrence in observational frequency series, have suggested the following investigation. 


Consider a number u of binomial series of which the sth is v,(p, + qg,)” and let us suppose 
a frequency series compounded by adding together the rth terms of all these series, such will be 
the compound frequency series it is proposed to discuss. We can realise its nature a little more 
concretely by supposing » balls drawn out of a bag containing Np white and Ngq black balls, 
N being very large as compared with n, or else each ball returned before a fresh draw, while 
the values of p and q change discontinuously at the v, + 1, v, + vp + 1, vy + vg + v3 + I, ete. 
draws. We shall take as origin the point at which the sum of the first terms of all the binomials 
may be supposed to be plotted. S will denote summation to w terms. 


Let N = S(v,), and Nyy’, Np,’ be the moments of the compound system about this origin, 
Np, Nps its moments about its mean. Thus 


Np’ =8 (vsNs), 
Np! = 1S) {Vs (NPsTs a n?qs”)} . 


5 12 
Hence Npg = nS {veds (1 - Gs)} + 12 (Sea?) - Eta) 
, n 9 TQ 9 
a Ney + N [nS {VsYs' (Us— Ia")? } — NS (v,95")], 


since NV = S(y,). Accordingly, if o be the standard deviation and m the mean of the compound 
series, i.e. py = 0”, py’ = m, then 
o 1 1 (MS {vevy (ds — We)?} — NS (1595") , 
i aN ( Swag ). spagouuSnueniadaneee (i) 
Now suppose we had endeavoured to fit a binomial V (P + @)* to the compound series, we 
should have had 


m = kQ, Gg? = IAG}, 
and accordingly have found 


Oe 1 (= (vars (ds - qs')"$ —- NS ete ae 
N Sera) istsieleitiavers e/e\aleieina eeicisiets (ii) 
7 {8 (vss)}* wee 

k= > = Srrslanacicaisbicamncs site onaiaee se 
MS {vgvg (Qe — Ys)” — NS (v5 qs”) uy) 


Thus, had we attempted to fit a binomial to the heterogeneous series, we should have found 
Q negative and P greater than unity provided 


nS {usve (Fs — qs)?} be > NS (¥s9s"), 
a condition which will frequently be found to be satisfied, especially if g, be small and n large. 


In the limit let us take ng, = m, and q, vanishingly small, i.e. suppose the sth binomial to be 
replaced by the Poisson series 


me me 
Oa Se aipg 2 ee ee 
( oh ena \2 ; 


then we have at once 
S {vgvg (ms — my)"} 


Q= NS {v,m,' 
S {vgve (ms — my)?} 
Pee MISKOEGTD Hearnb elaine tees aieaieiee eae aba (iv) 
(S {v,met)? 
A= 


S {v,vg (Mg — Mg)**” 


Thus, if two or more Poisson’s series be combined term by term from the first, then the 
compound will always be a negative binomial. This theorem was first pointed out to me by 
“Student” and suggested by him as a possible explanation of negative binomials occurring in 
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material which theoretically should obey the Law of Small Numbers, e.g. “Student’s” own 
Haemacytometer counts*. Of course the negative binomial may quite conceivably arise from 
other sources} than heterogeneity, but if this be the source of its origin in the material of Bortke- 
witsch, Mortara and McKendrick f, it is certainly most unfortunate that such material should have 
been selected to illustrate Poisson’s limit to the binomial. 


Now we know that the values about the mean of the successive moment coefficients of the 
binomial are 


2 = pq, 
fg = MPT (D— QZ), «= cw eanteecaecncteneseescesecreecerens (v) 
Hy = npg (1 + 3(n — 2) pg). 
Further the mean is at a distance ng from the first term p”. We shall call this distance m. 
Let py’, fe’, Hs’ and p,’ be the moment coefficients round the start of each binomial. Then 
My = 2G fp’ = npg + n°’, 
Bs = npg (p — gq) + 8npq X mgt mq®, ewes ween ere (vi) 
Pa’ = npg (1 + 3(n — 2) pq) + 4npq(p — 9) nq + 6 (mpg) n®g? + n4q?. 
From these equations we deduce 


ba = 29, Pe — py’ = n(n - 1)q°, 

ps — Spy + Quy = (w= Wie —2)igay ire emmeterereetet eee (vii) 

Pa’ ~ Gus’ + lpg’ - Guy’ = n(n — 1) (n — 2) (n - 3) q*. : 
Now let a, = p,’ for the combined series, 


ly = ps — pt,’ for the combined series, 

(3 = ps — 3p’ — 2p,’ for the combined series, 

Gy = py’ — Spy’ + 11,’ — Gy,’ for the combined series. 
Then we have, if A, = »,/N, d. = »/N: 


l=’, +A, 
= =H + Asde, 
w a 1) Mt Aedes (viii) 
n(n — Was 2) Aq? + Aoge"s 


A, 


n(n — 1)(n — 2) (n - 8) 
Multiply each equation by q, and subtract from that below it and we find: 


=" + Asqe*. / 


Cie = 3 
a G = re (Go- %)> 


eit Beer ahs es 
m(n-1) Aedes 0a) eee (ix) 
as ee LA es 2 a 
n(n-—1)(n—2) n(n-—1) Asda" (Yo — %)> 
2 43%) 


Wie = Ilo =D) Gu a8) he al eee 


* Biometrika, Vol. v. p. 356, and see also L. Whitaker’s examination of these data, Vol. x. p. 48. 

{ Pearson, Biometrika, Vol. 1v. p. 209. 

{ For an examination of Bortkewitsch and Mortara’s instances see L. Whitaker, loc. cit. pp. 49-66. 
McKendrick has recently reached Poisson’s series (Proceedings of the London Mathematical Society, 
Vol. xmr. (1914), pp. 405 et seq) without apparently recognising that he was on familiar ground, and 
has suggested its application to the frequencies obtained by counts of the bacilli ingested by leucocytes. 
He has fitted his series not by moments, but from the first two terms, and has failed to recognise that 
a large proportion of such leucocyte counts give also negative binomials. 


Ls, 
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Hence dividing each equation by the preceding one: 


Qe _— 4h a3 — _ % 4% aq 4 A 7 

n(n-1) nm _n(n—-1)(n—2) n(n—1)_ n(n—1)(n—2)(n—3) n(n—1)(n—- 2) _ 
eae a As nM a 
n : n(n-1) n n(n—1)(n- 2) n(n-— 1) 


Writing 9,¢. = P, and q, + dq = P;, we obtain 
a, — (n- l)a,P, + n(n - 1) P,=0, 
Gs (= Nae rer (ne = LN =D) ya — "Oy. Saren ss cease savososehsonnenn (xi) 
a, — (n — 3)a,P, + (n - 2)(n - 3)a,P, = 0, 
three equations to determine n, P, and P,. 
Eliminating - P, and P, we find: 
| dg (n-1)a, n(n - 1) 
Gs (Wet 2) ide s(n EB) (ME 2), tO} ce ccassncsccesseacscosse ons (xii) 
a, (n- 3)a, (n —- 2)(n- 3) a, 
which expanded gives us the cubic for n: 
N3 (2A, AyA3 — Ay? — 42d, + Ay, — G3”) + nN? (— 12a, a.d3 + Ta? + 4a,2a, — 3a,a, + 4a,”) 
+ n(22a,a,a, — 16a,° — 5a,2a, + 2a,a, — 3a3”) + (— 12a, agaz + 12a,% + 2a,?a,) = 0. ...(xii)>Is 


A root of this cubic substituted in the first two equations of (xi) will give P, and P, and then 
the quadratic 
Demee Dict loon BOF winnaiiae atic entantelsciesce te seeccimeisee (xiii) 


will determine the two values q, and q. corresponding to the value of n. The first two equations 
of (viii) then complete the solution by providing ), and dg. 


Until the roots of the cubic (xii)»!s have been discussed we can only assume that three 
solutions are possible. As a matter of fact in the examples so far dealt with some of these solutions 
have usually to be discarded. 


For the special case of Poisson’s limit to the binomial, we make n indefinitely large, q inde- 
finitely small, and ng = m finite. Hence equations (viii) become 


I SEO 
dy = Aym, + AQMy, 


AEN Mice rina Mocr hm sacndonetoors casa ecwescseesescsoa nse (xiv) 


(NMG ae DS 
dy = ym,* + AQm,1, 
leading to @—-90,+- Q=0, 
a3 ~ 4,Q, + 4 Q. = 0, 
Ay — A3Qy + AQ, = 0, 
if Q, =m, + Ms, Os = ialons 
Thus we find: 


Q1 = (43 — @4)/(A2 — a"), 


’ Qe = (A341 — 4y*)/(ay — ay”), 
subject to the condition * 


Of (QP e) pte Oo On 8 Ante Ola oscetecjsecinee cet ans-teisier (xv) 
Hence m, and mz, are roots of 
m? (dy — ay”) — mM (dz — AyAg) + AgQy — Mp2 = OL reeeececccceceeees (xvi) 
* Of course equations of condition hold for the 5th and higher moments in the case of the two 


binomial components. But they are of small service as the probable errors of these high moments are 
usually very considerable. 
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, | , 
Hy — Ms, Ne eels 
- ? 

Mm, — Mz Mz — My, 


Further he = 


which determine ), and ),. 

It is thus quite easy to resolve a series into the sum of two Poisson’s binomial limits provided 
the roots of the above quadratic are real. As illustration I take “Student’s” first count of yeast 
cells on the 400 squares of a haemacytometer. He found: 


No. of yea | 
Soaey ie 1 D> |, 73.8 a eomeleatotall 
Frequency 213 128 37 18 3 1 400 


giving: mean = p,’ = -6825, p. = -8117, pw; = 1-0876. 
From the above values of “‘Student” I determined 
fo’ = 1-2775, Bz = 3-0675. 
Whence a, = *6825, a, = -5950, a3; = :6000, 
and the resulting quadratic is 
-129,194m* — -193,913m + -055,475 = 0. 
The roots are m,= +3847 and m, = 1-1163, leading to A, = 59,295 and d, = -40,705, or, the 
series has for its two components 
vy = 237-18, m,= +3847, 
vo = 162°82, Mz = 1-1163. 


Calculating out the Poisson’s series for these components we have: 


No. of yeast : | 
valle 0 1 2 3 4 5 6 | 7 
= | 
Ist Compt. 161-44 62-11 11-95 1-53 “15 Ol -00 -00 
2nd Compt. 53-32 59-52 32-22 12-36 3-45 17 14 -02 
Round totals 215 122 44 14 4 Sie 
Observed 213 128 37 18 3 1] 
aL 


The test for ‘goodness of fit” for these six groups gives y” = 2°82 and P = -73, or the fit is 
very good. The negative binomial gave P = -52 and a single Poisson’s series only P= -04. But 
the double Poisson’s series places of course one constant more at our disposal than the binomial, 
and we can do still better with a double binomial, as we have four constants and only six 
frequencies, while the double Poisson has three constants to six frequencies. It is clear that 
neither of the above components forming 41% and 59% of the total number of cells, and having 
their means at -3847 and 1-1163 instead of -6825, gives any idea of a dominant constitution in the 
solution sampled. If in this case heterogeneity accounts for the negative binomial, then the 
difference of the components is not slight, and the heterogeneity being gross would indicate some 


considerable failure in technique. 

If we assume that the counts with a haemacytometer ought to follow the Poisson distribution, 
and this seems to be theoretically probable,—then the criterion of the binomial might well be 
adopted to ascertain the possibility of some failure in technique. The actual binomial in 


“Student’s” first case should be (g}g + #28)?" and any binomial with p very small and np = -6825 


would effectively represent the series; we could not anticipate getting n = 273 and p= qh> 
closely from the data, but we might certainly anticipate a positive binomial, if the theory of a 
Poisson distribution be correct. If on the other hand we say in this and many similar cases 
that the negative binomial arises from heterogeneity, then it appears to me that we have saved 
our theory at the expense of our technique. I propose now to test this point further by con- 
sidering the component binomials. If the theory of heterogeneity be correct, unless it be very 
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manifold, we might anticipate two binomial components, v, (p, + 41)” + v2 (Po + Go)", with 
n positive and large, both g, and q, being small, while n,q, and n.q2 would be approximately -3847 
and 1-1163, the frequencies v, and v, being roughly in the ratio of 3 to 2. 
Returning to ““Student’s” data we find p,4’ = 8-9275, whence 
a, = *6825, a, = *5950, a, = °6000, a, = 4800. 
Substituting in (xii)!s we obtain for the cubic: 
— :021,3269n? + -028,2321n2 + -363,3020n + -051,0825 = 0. 
This has three real roots, approximately. 
n’ = 4:89997, n’’ = — 14234, and n’”= —- 3-43390. 


We will consider in succession these cases: (i) n’ = 4:89997. The first two equations of 
(xi) provide 
-5950 — 2-66173P, + 19-10974P, = 0, 
-6000 — 1-72548P, + 7-71894P, = 0, 
leading to P, = -55304, P, = -04590, and the quadratic 
. ¢ — -55304¢ + -04590 = 0, 
whence we deduce the binomial factors 
q, = 4514, p, = 5486, and g,=-1017, p, = -8983. 
The first two equations of (viii) give 
T= yt AD; -6825/4-89997 = -451,355A, + -101,685A,, 
leading to , 


-107,535, Ay = *892,465, 
or, in a population of 400, 


Il 


43-014, vo = 356-986. 
Accordingly the compound series is given by 
43-014 (-5486 + -4514)4-59997 + 356-986 (-8983 + -1017)1-89997, 
with means of the components at 
m, = 22118 and m, = -4983. 
We see that neither the sizes of the component populations nor their means have any relation 
to the previously discussed component Poisson series; further the present series* diverge widely 


from Poisson series, 7 is not large nor q, or g, very small. Calculated to the nearest whole numbers 
we obtain: 


Vy 


No. of 
yeast cells v " : : ‘ : 
1st Compt. 2 ¥) 15 12 4 1 
2nd Compt. 211 117 26 3 0 0 
Combination 213 126 41 15 4 1 


Observed 213 128 37 18 3 1 


which leads to y? = 1-27 and P = -93. 
Thus the fit is excellent, but it does not correspond to the heterogeneity of a double Poisson 
series. 
(ii) 7” = — -14234. Here the first two equations of (xi) provide 
5950 + -77965P, + -16260P, = 0, 
-6000 + 1:27469P, + 1-75727P, = 0, 
and give P, = - -81529, PP, = -24996, 
with q@? + -81529q + -24996 = 0. 


* Tt should be noted that such fractional binomial series tend ultimately to become negative, 
although with negligibly small frequencies. 
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This gives imaginary values of g, and g, and thus the solution can for the present purpose be 
discarded. 


(ili) »’’”’” = — 3-43390. We deduce 
5950 + 3-02614P, + 15-22557P, = 0, 
6000 + 3-23317P, + 16-44372P, = 0, 


giving P, = - 1-21441, P, = :20229 
and g + 1:21441g + -20229 = 0. 
Hence % = — 1-015], p, = 2-015], Go = — °1993, Po = 1-1993. 
The A equations are 
=), + Ag, — -6845/3-4339 = — 1-0151), — -1993A,, 
leading to A, = 000043, Ay = 999957, 
or, vy, = ‘0172, vy = 399-9828. 


Thus the component series is 
0172 (2-0151 — 1-0151)—3-4399 + 399-9828 (1:1993 — -1993)3-4939, 
with means at m, = 3-4858 and m, = -6847. 


The first of these components is negligible, it contains roughly only -02 individuals in 400, 
and the second is sensibly identical with the negative binomial obtained by ‘“‘Student,” i.e. 


400 (1-1893 — -1893) —3-6054, 
with slightly modified constants. It provides: 


No. of 
yeast cells ° ; 2 2 = 


| 
| 
| 
Ree 214 122 | 45 14) 4/24 l 
wceseved 213 128 37 18 3 1 


leading to x? = 3-12 and P = -68, which for all practical purposes is as good as the double Poisson. 


Conclusions. It having been suggested that the appearance of negative binomials as better 
“fits” than Poisson’s series for material that is supposed to follow the law of small numbers is due 
to heterogeneity, formulae have been provided for testing whether this heterogeneity is due to 
a second component. If so this component should be small and the first component should 
substantially agree with the primary Poisson’s series. The smallness of the second component 
would measure the goodness of the technique in haemacytometer or opsonic index counts. 
Applied to ““Student’s” first series of counts of yeast cells we obtain (a) two Poisson’s series 
neither of which dominates the data or approximates to the primary Poisson’s series; (b) two 
positive binomials, neither of which has any approach to a Poisson series or any agreement with 
the components of (a); and lastly (c) two negative binomials, one of which dominates the series, 
and agrees with the primary negative binomial. This investigation as far as it goes suggests 
either that ‘““Student’s” first count is really described homogeneously by a negative binomial, or, 
if it be heterogeneous, then the heterogeneity is manifold, and no weight can be given to the 
results of fitting by the primary Poisson’s series. 


The general numerical discussion by the formulae of this paper of a variety of data assumed 
to follow the “law of small numbers” is in hand and will shortly be published. 


NOTE. 


With this double Part of Biometrika is issued an Appendix containing the 
Records of the late W. F. R. Weldon’s Mice Breeding Experiments. It was 
intended originally to issue these Records as an Appendix to the memoir in which 
the reduced observations are discussed. While the reductions have made con- 
siderable progress there has been unfortunately great delay in their completion 
owing to Assistants to whom they were entrusted being called away for other 
work, and owing to the whole Laboratory Staff for the past twelve months having 
other duties. As the present Records were set up more than two years ago, it is 
convenient to issue them at once, in anticipation of the explanatory memoir, 
which, it is hoped, will not be too long delayed. The determinations of colour 
and extent of pigmentation from the scales are due to F. J. W. upon whom 


also the more laborious part of the preparation for press of these Records has 
fallen. K. P. 


APPENDIX TO BIOMETRIKA. 


W. F. R. WELDON’S MICE BREEDING EXPERIMENTS. 
RECORDS OF MATINGS. 


For an Explanation of the Symbols used and Notes on the Individual Mice, 
see the conclusion of the Records. 
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W. F, R. Wexpon’s Records of Mice Matings (continued). 
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W. F. R. Wetpon’s Records of Mice Matings (continued) 
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EXPLANATION OF RECORDS. 


In the nomenclature adopted by Weldon, 0180 (say) represents the eightieth cross of the 
original races, i.e. Japanese waltzers and ordinary white mice. The resulting offspring of the 
first litter of this cross are a!, a’, a3, etc., of the second litter of the same cross 5}, b?, b°, etc., 
and of the third litter c!, c?, c’, c!, ete. The matings of hybrids are denoted by 1/90 for the 
first generation of hybrids, 2/790 for the second generation and so on. Thus a mouse 6170? 
denotes that it was the second mouse of the second litter resulting from the 17 mating in 
the 6th generation of hybrids. The characters of any mouse are given by a scheme of the 
ge momens tind 4H64 | 02g 75q58.55.1(4e) | 

This signifies that the mouse 4M64c? was a male; 75 denotes that the pigmented areas 
covered °75 of the total skin as determined by our scale of pigmented areas (see Plate I). 
g denotes that the eyes are black in this case, p would have marked that they were pink, the 
ordinary albino colour, 7 and dv that they were red or dark red corresponding to blood and 
very dark blood colour; as a rule mice with such eyes have only incompletely albinotic eyes, 
i.e. there is more or less pigment found on dissection. The next three figures in the present 
case, 58, 55 and 1, mark the skin colour of the mouse on the mouse skin pigment scale of the 
Biometric Laboratory. This scale was formed by matching glass mosaics to the actual colours 
of the mouse skins. This scale contains 61 tints. Tints 1—9 are the “belly whites,” leading 
up to 10—18, the palest blue greys; 19—27 are pale blue greys; 28—33 are yellow; 34—39, 
fawn yellow; 40—41 are chocolate ; 43—45, chinchilla ; 46—54, wild colour of various intensities, 
and 55—61, black of various intensities. 

Of the three tints obtained from this scale the first, 58 in our case, marks the colour at the 
centre of the median plane of the skin, or if this should fall on a leucotic patch in the case 
of the piebald, the tint of the nearest dark patch. The second number gives the colour of the 
skin halfway between the centre of the back and the centre of the belly, 55 in our case; and 
the third number is that of the colour of the belly. Very often the first two numbers are 
the same, but we found in a good many cases that the tint shaded off from centre of the back 
to the belly, and our second number is a rough measure of the lightening of the pigment as 
we proceed from back towards belly. The final symbols given in brackets are Weldon’s 
appreciation of the area of pigmentation given by the numbers 1 to 6 and his appreciation of 
colour as given by the letters a—g, ch. In the case of mice alive at his death and not then 
classed, these colour appreciations and pigment extents were provided as far as possible in 
accordance with his recorded judgments of other mice. 

In the figures on p. 44, reproduced from Biometrika, Vol. u1. p. 3, Fig. 6 represents a normal 
waltzing mouse. 

Fig. 1 is Weldon’s Class 1, more white and less extent of colour patches than a normal waltzer. 

Fig. 2 is Weldon’s Class 2, having about the same amount of colour patches as a normal 
waltzer. 

Fig. 3 is Weldon’s Class 3, having somewhat greater extent of colour patches than Class 2. 

Fig. 4 is Weldon’s Class 4. 

Fig. 5 is Weldon’s Class 5 with only a whitish belly. 

Weldon’s Class 6 is a whole colour mouse with nearly the same colour on the belly as on the 
back. Read with a planimeter Weldon’s Classes are represented very roughly by: 

1 2 3 4 5 6 
14 53 68 77 92 100 

We ourselves found considerable difficulty with “whole coloured” mice, and finally arranged 
them in three classes as follows: 

100x are mice with the skin pigmented all over, i.e. the same colour on belly as on the back; 
100y are those with a line of paler belly pigment, but not sufficient to take the skin out of the 
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100 percentage; 100z have a skin pigmented all over, but a strip of paler belly colour which, 
if white, would make the skin fall in the 80 to 95 percentage classes. This means that our 
1002 and 100y ought to be Weldon’s 6. As a rule our 100z are also his 6, but sometimes 
he denotes them by 5-6, and on some occasions he has called them 5. We have carefully 
considered these cases, but believe that our judgment, where not modified, must stand, as the 
belly pigment in these cases is often almost as intense as the back—e.g. 55 as against 58—and 
even when it is 9 against 19, this is not a pure white (1 only) but a shade of grey. We think 
our 100” for a pigmented belly with lighter hairs is more definite than terming the whole 
belly white, in which case our first number giving the extent of piebaldism would have to be 
modified, but the measurement would be exceedingly difficult as the pigment is there and only 
gets more plentiful towards the back. Asa rule Darbishire and Weldon judged the amount of 
pigment on the living mouse and the nature of the belly pigment was not so open to full 
observation as when determined from the skin. 


Weldon’s colour classification was as follows: 


a=yellow e =black 

b =fawn-yellow f =pale blue grey (“lilac”) 
e =pale wild colour g =chocolate 

d=dark wild colour ch=chinchilla 


Roughly Weldon’s a=our 28—33; his b=our 34—39; his c and d=our 46—-54; his e=our 
55—61; his f=our 10—27; his g=our 40—42; his ch=our 43—45. Naturally there is a good 
deal of personal equation in colour-determination, perhaps less with a colour scale than with 
mere verbal classification. Thus we should a priori have supposed c would range from about 
46 to 49 or 50 and d from 50 to even 55 or 56. But we find c used for as dark a tint as 55 
and d for as light a tint as 48. In the later records Weldon rarely, if ever, uses d, only ¢, 
so that its practical use is in the first or second hybrid generations, as in Mr Darbishire’s 
paper. There d may be as low as 45, a tint we should call chinchilla. Our experience would 
make us doubt whether a rigid line can possibly be drawn between the various colour classes, 
with the possible but not certain exception of the yellows. Even here such judgments as b-c 
and b-f may-occur. The differences of d and e, and f and g, may be so slight that intermediate 
classes may be formed as d-e, f-9. 

A w following the Weldon colour class in the brackets at the end of the description of the 
individual mouse marks that it was a waltzer. Intermediates in colour or in piebald class 
are marked by a hyphen, eg. a—b or 3-4. 

Of the original stocks, the Japanese Waltzers were either bought or bred in the Laboratory. 
They were of pure stock, for they bred true to the parents when crossed only among themselves. 
There were very slight differences of the fawn colour in the body, and all eyes were pink. 
A waltzer is distinguished by the letter W, and as a rule by its laboratory number. The 
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skins of these Waltzers were not preserved, but in the case of most of the animals a rough 
appreciation of the extent of the colour patches was conveyed by the use of the letters a and 
8 and their combination a8. Figure 7 on p. 45 represents an average waltzer =(af), all waltzers 
with appreciably less colour are noted by a, all waltzers with appreciably more colour by f. 

With regard to the albino parents we have distinguished ten classes, which are indicated in 
the following manner, the great difficulty is to be certain of the ancestry of albinos bought from 
dealers. In the Index Bt= Bought, Lb=Laboratory-bred. 


Albino Classes. 


(1) Parents and grandparents all albino, ancestry beyond unknown. : AQ 
(2) Both parents albino, but one or more grandparents known not to be Albee Ax3 
(3) One parent albino, and the other not albino . : , : : Ax2 
(4) Neither parent albino. : : : F : : : : : Axl 
(5) Nothing known of ancestry. : ; 5 Al? 
(6) Said to be pure-bred albino. . : : ; ; Al 
(7) Both parents albino, no definite tones of Poraoaents : : : AQ? 
(8) Back to grandparents all albino, but beyond known to be cross-bred — . ; A2x 
(9) Both parents and two grandparents known to be albino, no definite know- 
ledge of the other two. : Aas 
(10) Back to grandparents all albino, 1 or 2 peandneenr pure bree aibines no 
definite knowledge of others’ ancestry. : ‘ ; ; : Als 


In Weldon’s later breeding work on first crossss, i.e. the high numbered OH and 14, he paid 
much attention to the ancestry of his albinos, and the following pedigrees illustrate the ancestry 
in typical cases. The reader must note that the offspring stand at the top of the pedigrees, and 
we descend to parents and ancestry. 


(1) A48qa? Alb. in 1//148, 2152 
© A43a? (Alb.)= A2 ? 
| 


| 


: a 
| | 
3 from Mottram= Al? ® from Austin= Al ? 
(2) A58a* Alb. in 2H155 
° A58at (Alb.)=A x3 
[ “eee 
| 
3 cross-bred from Steer= A x © from Bacon= A1 ? 
(3) 1H153a' Alb. in 2124 
3 1H153a3 (Alb.) A x 2 
| 
ee | 
3 A25a3 Alb. = A24 9 0H103a' =3e 
| | 
ee ee | <a 
dé Alb. from Féckelmann,  @ Alb. from Steer, 3d Was, No. 16%? 9 Ala’ Alb. = 424 
No. 89=A1 ? No. 88=A1 | 
= 


I aa 


| | 
d Alb. from Bacon, @ Alb. from Steer 
No. 57= 41? No, 56= A1 
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(4) A159a* Alb. in 280 
© A159a4 (Alb.)= Al 
. | 


= eile == 


I aaa a ar | 
| 
g Avia! Alb.=Al 2 A77a° Alb.= Al 
| | 
| a Peay 
Aides Alba 9 Alda Alb,= Alf Ala Alb.=Al 9 Al4a! Al Ag 
| | 
I~ —— [ ] | p+ 


| | | | | | | | 
3 Pure bred 9 Pure bred ¢No.71 QNo.70 GNo.71 QNo.70 gNo.71 GNo. 70 
Alb. from Steer, Alb. from Steer, 
No. 71=Al No. 70=Al1 


(5) A26a> Alb. in 14/146 
6 A26a3 (Alb.)= A2 ? 
| 


| = 
| | 
3 Alb. from Paris= Al ? ? Alb. from Bacon=A1 ? 
(6) A32a8 Alb. in 11156 
6 A32a’ (Alb.)=A2? 
| 
a ge 
| 
3 Alb. from Foéckelmann= A1 ? 9 Alb. from Féckelmann = 41 ? 
(7) A46a? Alb. in 17/149 
3 A46a? (Alb.) = A2? 
| 
ee 
| | 
3 Alb. from Paris= Al ? @ Alb. from Paris=A1 ? 
(8) A48a* Alb. in 1H157 
6 A48a! (Alb.) = A2$ 
| 
r : | 
| 
d Alb. from Broxup, No. 18= 1 ? ° Alb. from Steer, No. 78= Al 
(9) A8la? Alb. in 14158, 1270 
3d A8la? (Alb.) = A24$ 
| 
ic eee it 
| | 
6 Ada? Alb. = A2$ 2? from Fockelmann= 1 ? 
| 
IF =a 


| 
6 Alb. from Steer= A1, 2 Alb. from Broxup=41 ?, 
No. 3 Alb. No. 59 Alb. 
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(10) 9 A1338a! Alb. in 14164; 9 A138a? Alb. in 1H165; 9 A133a! Alb. in 14167; 
g A133a° Alb. in 1H168; ¢ A133a7 Alb. in 11169. 
A133a1~? (Alb.) =A x2 These are } black-bred whites 
| 


ee 


fe = opp a] 
| | 
3 A60a!=6e 9 Adda? Alb.=A x2 
| | 
—— | : =] 


| | | | 
3 Black from Austin 9 Black from Austin 3 Black ® from Féckelmann=A1 ? 


(11) A136a! Alb. in 14171 
g A136a! (Alb.)=4x1 
| 


a1 a 


r rie 7 
| | 
3 A18a? black @ A18a! chocolate 
| 
| | | r 1 
| 
3 Chocolate from Field, @ Alb. from Steer= A1, 3 Alb., No. 79 9 Alb., No. 78 
Alb. No. 79 Alb. No. 78 
(12) Al47@> Alb. in 1/7187 
Q Al47a> (Alb.)=A2x 
ake 
a ee ee 
3d A95a! Alb. = A2x 2 A34d> Alb.=Ax3 
| | 
| | | ee 
| | 
g Al3a! Alb.=dA x 3 9 Alb. from Strong, 3 Alb. from Bacon= 41}, Q Lab. x bred 
| No. 12=A1? No. 57 Virgin Alb. = A x 
or 7 
| } 
¢ Alb. from Steer, 2) Cross-bred Alb. from 
No. 3=Al Broxup, No. 69= 4x 


(13) 4153a? Alb. in 14/183, 11273 
g A153a? (Alb.) = Ald 
| 


! 
I 1 


| | 
g A48a? Alb. = A25 Q A32u' ie = 42? 
| 


ro sel r | 1 
3 Alb. from Broxup, @Alb.fromSteer, 4 Alb. from Féckelmann 9 Alb. from Féckelmann 
No. 18=A1? No. 80= Al =Al? =Al! 


(14) A182a° Alb. in OH164, 11274 
Q A182a5 (Alb.)=A14 
| 


r : 1 
¢ A79a! Alb. = A424 2 A79a? Alb, = A24 
| 
I : | | 1 


| | os | 
¢ Pure bredfrom =? Alb. from Féckelmann 3 Pure bred from 9 Alb. from Féckelmann 
Steer=A1 =A? Steer=A1 =Al1? 
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(15) 0f155a? Alb. ; 0H156a! Alb. 
° 01/155a? (Alb.)=A x 2 
| 


r LE 
3d 1H42b!=3a-bpw @ A58at Alb.=A x3 
| | 
i Gi [ ae 
| : 
3 0OH1a?=3e 9 OM12a°=5a 3 Cross-bred 9 Alb. from 
| | Alb. from Steer Bacon= A1? 
a fo =| =AxlordAx2 No. 15 Alb. 
| | | | 
36 No.l=Wa  @Cross-bred 3 No. 11= Wa @ Alb, from 
Lab. Alb.= A x Strong= 41? 
No. 1 Alb. No. 11 Alb. 


0156a! is the same except that the mother is 458a° instead of at. 


In the following Notes on Individual Mice, we have entered chiefly Weldon’s remarks, 
but occasionally we show reason for differing from the account given of individual mice in 
Darbishire’s memoir. We have only done so where on due consideration it seemed necessary, 
and chiefly because we found a wrong transcription from the records, or a characterisation not 
in accordance with the skins in our possession. From OH to 2H matings we reproduce largely 
the results as given by Darbishire, although Weldon himself increased the number of these 
matings and had preserved for his own work all the skins possible. From 3/7 to 6H the 
matings are taken entirely from the card indices of Weldon, which were completed after his 
death when the stock in existence was killed and the skins preserved. 

The notes we think throw considerable light on the difficulties of a large series of experi- 
ments of this kind; they mark the extent to which the mothers destroyed their offspring ; the 
reasons why gaps occur in the matings, or in the skins preserved*; the occurrence of deformities 
or pathological states; the possibility of the young mating at a very early stage, and the 
difficulty of preventing at least some escapes. We note also how hard at times is colour classifi- 
cation by mere verbal categories. The following terms are abbreviated : 


outbred=OB ; cross-bred= VB; inbred=/B; pure-bred= PB, 


NOTES ON INDIVIDUAL MICE. 


OH Matings. 


0#8 or 7 A wild coloured mouse from Cross 7 got loose but was caught ; it was put back with 
the young of Cross 8. Anyone of the litter called 8a therefore may be 7a.—OH8 or 7a? A huge 
mouse, looks like a rat.—0H8 or 7a® Died young.—0H9a! Mangy.—0H17, 0H19 Young all 
eaten.—0/735a', 0H35a4, 0OH35a° Mangy.—0H45 In Book of Crosses it is distinctly stated under 
Cross 45 that the father was the ¢ from Cross 10. Under Cross 6 the ¢ from Cross 10 is said 
to be No. 6. No. 6 in picture is W8 but under Cross 45 in Biometrika, Vol. 11. the father 
is stated to be Wa.—0/45a3 This mouse waltzed, but not in a very close circle.-—0H53, 0/55 
Young eaten.—_OH61a* Skin not worth keeping.—062 Mother inbred 5 years.—OH72 Offspring 
died young.—0H84a? The colour of coat is almost cream.—OH84a’ Escaped.—0//85a° Mange 
on face.—OH87 Offspring born dead.—OH90 Three offspring eaten.—OH91a1 Died young.— 


* Occasionally if two skins in one mating were identical only one was preserved, but usually all 
were kept, if possible. 
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0199 Offspring born dead.—0//102a® Given away.—0//113a! Killed by relatives.—0H113a% Skin 
not kept.—OH113a7 Killed by his doe.—0H113a8 Escaped.—0H114a! Ill and killed.—0A117a# 
Mange on face.—0H120a8 Unhealthy and very small.—0H124a3 Escaped.—0H128 The two 
3cw offspring were killed and mangled beyond recognition.—0H130, 01131, 0H132 The three 
waltzer fathers are brothers.— 0H138a? Very small.—0H139a* Escaped.—0H142 Mother OB, XB. 
—0H142a> Skin destroyed.—0H143 Mother extracted recessive Albino from Prof. Cuénot.— 
027144 Mother refused to suckle young, so all died; mother 75, PB.—0H145 These 5’s are 
catalogued as 6’s. The colour is present but light on belly—0#147 Several offspring died young ; 
mother /B, YB.—OH150 Several offspring died young.—OH152a*, 0H152a°, 0H152a7 Died 
young.—0H153 Several offspring died young. Two 4c’s are w.—0H156a! The lightest of c’s.— 
0H159 Mother OB, PB.—0H159a1, 0H159a*, 0159a, 0H159a+ Escaped.—0H160 Mother 
OB, PB.—OH161, 0162, 0163 Mothers JB, PB.—0H163a> Died young.—0H165, 0H166 
Mothers OB, PB.—OH166a! Escaped and killed.—0H167 Mother OB, PB.—0H168 Mother 
ate young.—0H169 Father JB, PB.—0H169a! Escaped.—0170 Mother black inbred.—0H171 
Mother black inbred.—0H172 Mother black inbred, several offspring died young.—0H175, 
OH176 Mothers 7 black bred.—OH179, 0H184 Mothers 05, X5L.—O0H185 Mother JB, PB. 


1H Matings. 


11 All 4 young died at birth—1/3a!, 1H3a? Died young.—1/73a', 1H3a> Very pale 
“wild.”—1 H4a! Escaped.—1H5a! Mangy.—1 H6a# Escaped and died.—1H7a3 Very light “wild.” 
—1H9a' Very light c.—1H18a! Died young, also 2 albino offspring of 1118 mating.—1H18a!t No 
head movements.—1H19a3, 128)! Escaped.—1 H29a? Is misprinted e for ¢ in Biometrika and 
so in Catalogue.—1 29a Skin destroyed.—1 34a, 1435a Both young born in the same cage, no 
good therefore and killed.—1 36a! The colour of the coat is intermediate between fawn and ¢e.— 
1#36a° The coat colour is nearer e than that of the last.—1H37a?, 1H37a* The coat colour 
is intermediate between fawn and c.—1H42b! The coat colour is curious. There is a “blaze” 
of ¢ on the face and top of head.—1H48a?, 1H48a* Escaped.—1H43a* Too rotten to skin.-— 
1746 All young killed.—1 49a? Described as chocolate; it is intermediate between chocolate 
and a very pale homogeneous ¢.—1/49b° Mangy.—1 #51a', 1H51la? Very pale c.—1H55, 1H56 
Note that the does in these two crosses are sisters, and that the one in 155 is crossed with an 
outbred pure-bred albino, and that the one in 156 is crossed with an inbred pure-bred 
albino.—1//55a! Colour only very slightly darker than fawn, though in the direction of c.— 
1H56a! Mange.—1 H57a°, 1 H57a° Died young.—1H58a! Colour between fawn and ¢ with, perhaps, 
a soft chocolate tinge.—1H59a!, 1H59a°, 1H59a°, 1H59a7 Back legs paralysed of these albino 
young.—1 59a! The colour is rather a soft c.—1H60a° Mange and a damaged foot.—1H60a% 
Died young.—1H63a! The d is very soft.—1H64a! Skin not worth keeping.—1H64a? Rather 
tending to chocolate.—1H66a? The colour is not quite f: it has a suspicion of yellow in it.— 
1H66a* The yellow is very dull, not unlike that of a normal waltzer, but merging very slightly 
to lilac.—1H66a® The colour is almost completely intermediate between that of a waltzer and 
lilac.—1H67 One g and one ? offspring. Both mangy and killed.—1 468 Note that in 168, 
the only hybrid incest pair, there are 3 waltzers out of 5 young. One ¢ and one @ offspring 
mangy.—1 70a? Waltzer’s colour.—1H70a* General form of waltzer. Mangled beyond recog- 
nition.-—1#71 Albino offspring died young, the rest eaten.—1H72 Young eaten.—1H80a‘ The 
colour is a very pale chocolate.—1H81, 1482, 1H83a Young born together: they are described 
as 81a and were killed.—1//84, 1H85a Born together. They are described as 1/784 and were 
chloroformed. 184 comes to an end. 1485 goes on as 1H85b.—1H88, 1H89 Two or one 
family: born in cage before isolation; the young are described as 1H88a and were killed. One 
9? is left with ¢ as 1488). 1489 comes to an end.—1H90, 1H91 Born together. 190 
continued as 1/4/90b. 191 comes to an end. Young recorded and killed. 
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Note by W. F. R. WELDON : 


The last two crosses had three objects (1st) to see whether whites (produced by crossing 
whites with first crosses) when crossed with first crosses—i.e. lst generation hybrids—give the 
Mendelian numbers in their offspring, ie. 50°/, albinos and 50 °/, hybrids, (2nd) to see whether 
albinos produced in this way behave the same when crossed with waltzers as do ordinary pure- 
bred albinos, and (8rd) whether albinos whose ancestry has been “watered down” in this way 
for many generations will breed true when paired. With regard to the “ watering down” of the 
albino ancestry of an albino, if hybrids at the first generation are paired with albinos, and 
the resulting albinos are paired back with hybrids of the first generation, the following table 
shows what happens : 


The fractions in brackets after Hyb. or Alb, indicate the amount of albino ancestry in the 


animal. 
Hyb. (3) x Alb, ——— 23 =14 
Alb. (3) x Hyb. ——— 3 =1} 
Alb. (8) x Hyd. ——. 2 =1] 
Alb. (39>) x Hyb. ——— 1% =1,5 
Alb. (4h) x Hy. —— #2 =135 
Alb. (33) x Hy. —— $F =19 
Alb. (85) x Hyb. —— 333=ly2s 
Alb: (BAR) KBYDs veacccasscstecststssnes 


From which it will be seen that to find the albino ancestry of an albino in a given generation 
the denominator of the fraction indicating the albino ancestry of the previous generation must 
be doubled to produce that of the desired generation: and to find the numerator of the gene- 
ration in question that of the previous one must be doubled and 1 subtracted. 


But it is evident, that continually breeding back with a first cross is bad: because the 
ancestry of the desired albino can never be reduced below a half. The first step (that taken 
in 1/790 to 91) is all right because the ? in 1H91 is Alb. (}) and the young produced (i.e. 
1H91a)?, ete.) will be alb. (3), ie. an J less albino ancestry }=12°; then the degree in which 
the “ watering down” diminishes is shown in the following series : 


or 16, 8, 4, 2, ig — 

or ancestry of 2 in 1//91, i.e. H6a? ="750 
5 5 191d, ete. = 625 
: z Albino in next generation =*562 
” ” ” ” ” ae ‘531 
” ” ” ” ” = 515 
” ” ” ” ” = 507 
” ” ” ” ” 503 


that is to say starting with Alb. (3) (e.g. H6a* in 191) we can in the 6th generation produce 
Alb. (°503). 


1794 Parents doubtful, at any rate in colour of father and the 4 does H84a were pregnant 
when sexed. Offspring destroyed.—1H95 The parent females 62a etc. were pregnant when 
sexed.—1/795a3 Colour a mixture, between fawn, chocolate and c.—1H95a+ Colour pale chocolate. 
—195a° Killed by relations.—196a? Colour a very pale fawn.—1H97a! Pale fawn, with ttle 
white on belly.—1H98a! Much paler than a*.—1H98a° Extremely like 1H98a4, but with less 
white below.—1H98a° A very blue fawn colour.—1 99a! Dark fawn. The belly lighter but 
hardly any real white.—1H99a? Pale ec. Skin not kept.—1H99a* Died young.—1H100a* Skin 
not kept.—1H100u° Rather dark fawn.—1/100a® Pale fawn.—1/H101a! Colour a pale blue-grey, 
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like a mixture of fawn and lilac.—1101la4 Colour not black.—1//101a° Died young.—1H101a7 
No record except skin.—1H1010® Mixture of dark b and very pale chocolate with a look of 
fawn.—1//102a' Colour exactly intermediate between 6 and c.—1H1020? The palest lilac 
imaginable: if not placed beside an albino, it is taken for an albino.—1/102b? A pale but 
obvious 7,—1H102b' A darker f than 1/10263—1H108a! Colour dark fawn.—1H103a? Colour 
yellow fawn. Skin rotten and not kept.—1H103a* Colour very dark fawn or very pale “ wild.”— 
1H103b° The appearance, behaviour and complaints of waltzers are present in this mouse.— 
1H104a° Died young.—1#105a' Colour a brownish lavender.—1//105b! Escaped.—1H105b2 Died 
young.—1/107a! Very pale “ wild.”—1/108, 1H109a@ All offspring died young.—1109a! Very 
like pure race.—1H109a? Colour greyer than in 1H109a1.—1H109b° Died young.—1#H111@ 
A uniform pale grey almost intermediate between a and e.—1//111a? Colour a grey fawn. Died 
young.—1/111la* Colour paler than a? but the belly less white-——-1H1lla® Died young.— 
1#H112a% Colour called f but it is really too brown for lilac_—l1H112a* Exhibits waltzing 
tendencies.—1//114a! Skin not worth keeping.—-1H115a?, 1H115a* Died young.—1 H115b! Some 
parts of ears are clothed with yellow hair.—1H117b! A sharp patch of white between fore legs, 
another much smaller on belly.—1//1180?, 1411867 Escaped.—1121a! Colour very pale.— 
1H121a* Colour very dark.—1H1216! A very dark d.—1/7121b° A measly specimen. Escaped.— 
1/71220°, 1/7122a7 Skins not kept.—1H124a1, 1H124a4, 1H124a° Skins not kept. Head destroyed 
in 1#124a1.—1 1244 Rather of a dark 6.—1H124b° Died young.—14125a2, 1H125a4, 1H125a5, 
17125e°, 1H125a7 Skins not kept.—1H125b All offspring died.—1H126a3 Died young.— 
1/126? Waltzing and neurotic tendency very well pronounced.—1H1260? Called erroneously 
ein Biometrika, Vol. 11.—1H127b' Darker than 14#127b2,—1H127b° Mange.—1 1280! A dark b.— 
1H129a! Died before being painted.—1H129a? Skin not kept.—1H129b Catalogue, Biometrika, 
Vol. 1. and skins differ wholly from Children of Hybrids record book.—1H1296' Died young, 
Only mentioned in mating book.—1#H130a? Very pale 6.—1H133b!, 1H133b? Died young.— 
113363 Very pale.—1H13304, 1/7133) Very pale, died young.—1H135a', 1H135a4 Died young.— 
177136b° Colour intermediate between 6 and chocolate.—1137a! Colour intermediate between 
chocolate and c.—1H137b? Died young.—1H137)%, 1H138a?, 1H138a3 Died young.—1H1390! 
Called 6e. But by no means a pure black. Several hairs on chest, between front legs, have 
brown pigment; and a silvery appearance given by a very few scattered white (?) hairs on sides, 
and a very few on back. Claws white. Hair on hands and feet white: most caudal hairs 
white. Skin of tail pink with a few dark patches.—1H140a! Died young.—1140a? Died 
Young. Distinctly a 5c. Called 6¢ in Biometrika, Vol. 111. but 5¢ in Mating book.—1H140a® 
Died young.—1 7/1423 A very small admixture of long whitish hairs on sides and belly. Digits 
pink with some white hairs. Tail with patches of pink skin, and on these white hairs. 
Vibrissae black.—1H146b> Killed before sexing.—1H149a° Died young.—1/149b4, 1/1503 
Escaped.—1 1526 Young eaten.—1H152c® Skin colour same as 1H152c’,—1H158a5, 1H154a?, 
171550, 1f1155a° Escaped.—1H156a! The hair of the belly rather a dark brown than black. 
On haunches and sides of belly a very few white hairs. One or two white vibrissae. Nails 
pink. Digits and tail with white hairs.—1H158a! Killed accidentally..—14160a Offspring died 
young.—1//161la® Eaten.—1H162a! Not very healthy, skin destroyed.—1/163a1, 1//163a? Skins 
not kept.—1H164 Mother black bred.—1H164a? c very pale.—1H165, 1H166 Mothers black 
bred.—1H167 Mother again black bred.—1//168, 1169 Fathers black bred.—1H170 Mother 
black bred.—1H170a° Very pale. (We should certainly have called it a pale blue grey verging 
on chocolate. K. P. and F. J. W.).—1#171, 1172 Fathers black bred.—1H172a! Escaped.— 
1H172a° Killed accidentally.—1H173a* Really a very dark chocolate, a very wonderful colour.— 
1H177 All offspring eaten.—1//178a Colour same as 1H178a’.—1H179a*, 1H179a® Coats same 
as 1H179a3.—1H182a® Yellow underneath.--1H187a One stillborn offspring only.—1H196a® 
Skin identical with 1H196a3.—1H20la! Very pale.—1H204a? Skin like 1H204a1.—1204a 
Skin like 14204a3.—1H205a? Skin like 1H205a!.—1H205a° Skin like 1H205a°.—1206a" Skin 
like 1H206a%.—1H206, 1207 Mothers JB, PB.—1H208, 1H209 Fathers 7B, PB.—1H211a? (We 
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consider this mouse a chocolate. K. P. and F. J. W.).—1H217a! Colour almost chocolate.— 
1220a3, 1H220a!, 1H220a° Escaped.—1 H222a° Darker than 14222a°.—1H222a° Same colour 
as 14222a7,—171227a* Not found in Catalogue.—1/237 Both parents from extracted albinos.— 
1H237a* Paler, more f than 14237a3.—1H237a' Little browner than 1H237a°.—1238 Both 
parents from extracted albinos.—1H238a! Skin like 1238a3.—1//239a° Escaped.—1 239, 
17240, 1H241, 1242 Both parents from extracted albinos.—1H242a4, 1H242a° Skins like 
1242a°,—1 243, 1H244 Both parents from extracted albinos.—1//244a' Colour like 1H244a7,— 
1245 Both parents from extracted albinos.—1246 Mother from Cuénot’s extracted ? albino. 
—1/246a* Skin like 1H246a°,—1H248 Both parents from extracted albinos, the mother from 
Cuénot’s extracted 2 albino,—1/248u5, 1H248a7 Colour like 1H248a3,—1H249 Both parents 
from extracted albinos, the father from Cuénot’s extracted ? albino.—1H249a° Colour like 
1#249a°.—1 250 Both parents from extracted albinos, the father from Cuénot’s extracted 
? albino.—1H250a? Skin not kept.—1H252 Both parents from extracted albinos.—1 H252a? 
The waltzing movements of this mouse were extraordinary; it was incapable of locomotion.— 
1253 Both parents are hybrids from the extracted G’G’ 142b!.—1H254 Mother is hybrid 
from the extracted GG’, 1H42b'.—1 255, 1256, 1H257, 1258 Both parents are hybrids from 
the extracted G’G’ 1H42l'!.—1H262a! The b is curious: rather like a very pale g.—1H263 
Both parents from extracted albinos.—1H266 One 5a offspring is a w.—l1H270a! Black gene- 
rally: but a few of the longer hairs on the sides and thighs white or yellow. A little white 
hair on the toes. The nails pink. The tail mottled brown and pink, the hairs on it white: 
A mixture of white hairs round the anus.—1//270a! Escaped.—1/273a? Considerable mixture 
of long brownish and whitish hairs over whole body. All digits and most of dorsum of R. 
pes with white hairs. Some brown (and white ?) vibrissae. Tail hairs mixed.—1 278, 14280 
Fathers are hybrid from the extracted GG’ 142b!.—1H282 Young eaten.—1H292 One 5e 
offspring is w.—1H297 One 5e offspring is w.—1H301 Two of the 2c offspring are w.—1H308 
One 2c offspring is w.—1H310 One 5e offspring is w.—1H312 One 2e¢ offspring is w.—1313a? 
No record except skin. 6e taken from skin.—1H324a! No record except skin. 2e taken from 
skin.— 1/7329 One 5c offspring and one 50 are w. 


2H Matings. 


2H12a? Darker than a!.—2H12a? The colour simulates pale g.—2H22a? Skin like a2.— 
2H22a° Skin like a.—2H27a! (Skin we think must be described as an @ like the Catalogue 
and not 6, ¢ like Biometrika. K.P. and F. J. W.).—2H28a° Colour like a?.—2H32a3, 2H32a4, 
2H32a° Skins like a?.—2H33a° Hardly any hair.—2 38a! Colour intermediate between chocolate 
and c.—2H38a? The 6b is dark tending to chocolate.—2H38a'!, 2H38a5 The 6 is not very 
yellow.—2H40a! Homogeneous grey tending to chocolate.—-2H40a® Tending to dark 6.— 
2H45a?, 2H45a Colour like a.—2H49a! Much blacker than any hybrid in present stock. No 
white or brown hairs on back: a very few scattered brown and white hairs on sides ; more on 
belly : and grey ring round perineum. ‘Tail hairs white. Dorsum of digits white. Claws pink. 
Vibrissae all black.—2 49a? A distinctly greater mixture of yellow or brown hairs along sides of 
body than in 2H49a', and a group of very few white hairs in middle of belly. Dorsum of feet 
and hands with much pink skin and many white hairs. About 4 the tail pink: the tail hairs 
white. Brown hairs round base of ears, as is usual in “black” mice. Vibrissae black.— 
2H79a% % 4+ One of these 4c’s given as 5¢ in Biometrika.—2H79a2 Colour like a!.—2H79a3, 
2H79a* Skins like a2,—2H79a7 Skin like a!.—282a* Skin like a?.—2H92a! or a? One of these 
3c’s is a waltzer.—2//93b* Supplied from skins as no records are given.—2H96a! Fairly black. 
A few (very few) yellowish hairs on sides and haunches. Perineal patch grey ; a good many wild- 
coloured hairs mixed with the black on belly, and a very small spot of a few nearly white hairs 
about umbilicus. Digits pink, with white hairs. Whiskers black. Hairs of tail some black (?), 
some white.—2H101la?, a3, at or a® One of these 6c’s is a waltzer.—2H102a! Very good pure 
black dorsally: perineum, sides, and upper part of limbs with mixture of wild-coloured hairs. 
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A few white hairs on belly. Brown hairs round bases of ears well marked. Fingers and toes 
pink, with white hairs. Tail with dark skin, the hairs wild colour and white. Vibrissae black.— 
2H104a' or a One of these is a waltzer.—2108a! A very good black, with very few white and 
brown hairs on sides and belly. The perineal patch wild-coloured. Hardly any brown hairs at 
base of ears. Digits pink, with white hairs. Tail skin pink and dark. Hairs of tail some 
white, some brown or black. Vibrissae black. —2H108a? Colour almost like 2/108a!, but with 
more brown hairs on belly.—2H109 One 4e offspring is a waltzer.—2H114a! A very good deep 
black, with a very few whitish hairs dorsally and few scattered brownish hairs on sides, and 
more, but not many, on belly. Hardly any brown hair round base of ears. Perineal patch grey. 
Digits white. Skin of tail pigmented, but many hairs white. Two or three white vibrissae on 
L. side, three on R. side, the rest black.—2H117a! A brownish black above, with considerable 
mixture of whitish or white hairs on limbs and a few on sides of trunk. Belly and perineal 
patch grey from mixture of white and brown hairs. Digits and dorsum of hands and feet with 
much white hair. Tail brown, with much white hair. Hair at base of ears pale brown. Some 
white vibrissae.—2/124a! A very black mouse, with very small admixture of brownish hairs on 
sides and belly. Perineal patch very small, brown. Hardly any brown hairs round base of ears. 
Toes and fingers pink with white hairs. Vibrissae black.—2 143 One 6e offspring is a waltzer.— 
2H147a! or a One of these has pink eyes.—2H 151a1, a? or a4 One of these has pink eyes.—27155a1 
Really very little white. A small patch of pure white on belly: the fingers and toes pink with 
white hairs. The skin of the tail nearly all dark, but with many white hairs. The rest a very 
good black with the long hairs darker than usual. Pale brown hair round base of ears. Vibrissae 
black.—2H155a?2 A good black with very few lighter hairs, Fingers and toes pink with white 
hairs: the tail skin pink for about } its length, with white hairs. Brown hairs round base of 
ears. Whiskers black.—2H155a? A fair black above, and at sides, with a very few white hairs. 
Perineal hairs mixed with white. Considerable mixture of white hairs on belly. Hairs round 
base of ears pale brown. Digits pink, and much white hair on hands and feet. Most of tail 
pink, the hairs white. Vibrissae black.—2H156a? A poor black, with many long, whitish hairs 
at sides. Perineal patch considerable, yellowish white. Belly with not much more white than 
sides. Much white hair on fore-limbs, and on hands and feet. Digits pink. Hair round base 
of ears very pale. Some white vibrissae.—2H156a* Almost exactly like 2H156a*, but the 
whitish hairs on sides and belly a little more plentiful—2H156a* Almost like 2H156a?, but 
without any white whiskers, and with the hair round the base of the ears still paler.—2300a? 
A very good black; extremely few brown or white hairs scattered on belly and perineal area. 
Skin of tail dark above, also below except near base. Vibrissae black. Most hairs on ears black, 
the rest deep brown. Claws pink. Dorsum of manus and pes black ; white hairs on phalanges IIT, 
II, and parts of I in each digit.—2H300b° Very good black. Very little brownish hair on belly 
and perineum. Ears and hair behind them brown. Phalanges of feet pink, with white hairs ; 
those of hand darker, with few or no white hairs. Tail skin dark, with some white hairs. 
Whiskers black.-—2300c? A good black: a little scattered (white? or) yellowish hair on sides 
and belly. Hands and feet pink. Tail pigmented with pale hairs. Many pale whiskers. 
Perineal patch small. Ears brown.—2H300c! A rustier black than 2H300c?; the hands with 
more white hair: a little unpigmented skin round base of tail.— 2 H300c° Generally like 2H300c°, 
but the pale hairs whiter, and the distal { of the tail unpigmented.—2H3801a" Much scattered 
pale brown (b or ¢?) hair on sides: belly with distinct brown tinge. Ears and hair behind 
brown. Tail dark with brown hairs. Distal phalanges of foot pink, with white hairs; those of 
manus pink with few or no white but some brown hairs.—2H 301d? A very uniform brown-black, 
darker than d.—2H301a* A yellowish lilac.—2H301a> Little scattered pale (a and white) hair. 
Perineum pale a. Ears pale brown ; hair behind them nearly white. Nostrils pink. All digits 
more or less pink, with pale or white hairs. Tail, skin darker above, paler below, with white 
hairs. Many vibrissae white.—2301a° General colour black, with a little scattered white hair ; 
but about 3 very small white spots on R. haunch. Whiskers black. Otherwise like 2H301a5,— 
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2H302b? A good black: the scattered hairs distinctly yellow. Digits pale with yellowish hairs. 
Perineal patch yellowish. Tail pigmented with yellowish hairs. Many pale whiskers.—2H30263 
Like 30202 but with less coloured hair. The toes and tail darker.—2H3026! Small white blaze 
on forehead : white snout: white patch on belly : grizzly line on haunches.—2 302° An extremely 
good black: even the digits with much pigment. Ears still brown.—-2302b° Body nearly or 
quite as black as 302b°, but the digits pink.—2302ct Fair black. Long hairs on sides, and 
quarters yellowish white. Digits pink. Ears brown. Whiskers black. Tail pigmented with 
whitish hairs.—27302c* Fair black. Long hairs on sides yellower than in 302c%, Digits pinker. 
Perineal patch larger and yellowish white. Some pale whiskers. Tail pigmented with whitish 
hairs.—2H302c® Rather more scattered pale hairs than 2/7302c. One or two isolated pale hairs 
on forehead. The digits pinker than in 302c; the whiskers conspicuously mixed black and 
pale.—2303a! A good black above: scattered pale hairs at sides and below: perineum pale 
grey. Ears brown: hair behind them pale. Most digits pink with white hairs. Skin of tail 
dark with white hairs. Some white vibrissae.—2H303a? Fair black. Feet, hands and tail pink, 
with white hairs. A white patch on one side of snout. All (?) whiskers white. A white patch 
across belly, running as narrow band up each side.—2/7/303a? A large white belly patch, with 
narrow white band obliquely across back. Hands, feet and tail pink.—2H303a! Body a good 
black, with very few pale hairs on belly and perineum. Tail dark, with some pale hairs below. 
Digits pink: those of R. pes with white hairs only: those of L. with fewer: those of manus with 
very few or none. Ears brown, the hair behind them brown. Whiskers black.—2 30363 Minute 
white patch on forehead : large white belly patch; faint white bar across haunches.—2H303b! 
Small white belly patch.—2303b° A fair black. Hands and feet very pink, with white hair. 
The scattered lateral hairs more yellowish.—2 304 One black eyed offspring born dead, a mating 
previous to a.—2H30403 A fair black. Scattered yellowish hairs on sides. Perineal patch 
yellowish. Tail pigmented with pale hairs. Digits of manus slightly pigmented, of pes pink. 
Hair on and behind ears brown. Whiskers black.—2H304a% Good black. Few scattered 
yellowish hairs. Perineal patch yellowish. Digits pink. A few pale whiskers. Tail feebly 
pigmented with pale or nearly white hairs.—2/7304? Tail coloured.—2H304b? Tail ringed pink. 
—2H305a1, 2H305a%, 2H305a4, 2H305a5 A yellowish lilac.—2H307a! Killed by ?.—2H308a! 
A fairly typical fawn, the belly white.—2H808a? A pale and rather “blue” fawn. Belly 
coloured except for a small angular patch of pure white.—2/308a' A rather dark fawn. Belly 
white. —2308a! A rather “blue” fawn: belly coloured with sharply defined white patch.-- 
2H308a° Paler than a4. The white of the belly comes on to the sides at the waist.—2308a° 
Almost exactly intermediate between fawn and lilac. Belly white.—2H309a! A typical fawn. 
Skin not kept.—2H309a? A typical fawn.—2H309a, 21309a! Rather grey fawn.-—2H309c!, c?, 
c, ct Died before sexing.—21309e°, ®, c’, c® Died and half eaten.—2 7312a! Some scattered pale 
hairs at sides : belly with many pale hairs: perineum pale. Ears and hair behind them brown. 
Tail skin dark, with white hair, especially at tip. Digits pink with white hair. Some white 
vibrissae.—2/7312a° A fair black: tail pinkish with white hairs, and a narrow linear streak of 
white on belly. Digits pink with white hairs.—2H312b! Escaped.—2H312)3 Fairly uniform, 
but very rusty black. Perineum grey. Tail dark with white hairs. Hands and feet pink with 
white hairs. Ears and hair behind them brown.—2H312b° A good black. Ears dark brown, 
hair behind them lighter. Very little paleness on belly and perineum. Digits pink, with some 
white hairs.-—-2/7317 Data marked * supplied from skins. 


3H Matings. 
31a’ A dark chocolate.—3H1a3 A yellowish lilac.—3/Hla* A yellowish lilac: colour very 
uniform.—3H1qa? A yellowish lilac : indistinguishable from 3H1a!—3H1a’ Some scattered light 
hairs on belly. Hairs behind ears nearly white. White hairs on digits. Whiskers black. Tail 
with dark skin.—3H10° A rusty black, with scattered yellow (a?) hairs. Belly and perineum with 
so much yellow hair that the mouse can hardly be put in Class 6. Ears brown: hair behind them 
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very pale. Feet pink: terminal phalanges of hand with white or pale yellow hairs. Tail skin 
dark, hairs pale. Whiskers black.—3H1b! Body a good black above (darker than in 3H10%), 
belly and perineum with much yellowish scattered hair. Ears and hair behind them brown. 
All digits with white or pale hairs. Tail pigmented except at tip, with pale hairs. A few pale 
whiskers.—3H1b7 Almost exactly like 3H1b°. Escaped.—3H2a? A rusty black: a narrow pure 
white patch on belly; much scattered pale hair among black, at sides and below. Perineal patch 
with much white hair. Hair on and behind ears fawn coloured. Digits and dorsum of hands and 
feet with white hairs. Tail with patches of pink skin and some white hairs. Some pale 
whiskers.—3H2a* No definite white patch but much scattered pale (fawn?) hair on sides and 
belly. Hands and feet with much pink skin and white (?) hair. Tazd almost all dark skinned, 
with pale hair. Hair on ears pale fawn: behind ears nearly white. Vibrissae black.—3H2a™ 
Much scattered white hair. Much pink skin and white hair on hands, feet and tail. Hair 
behind ears whiter than in a4, Whiskers black.—3H2b4 Much fawn hair scattered on sides, 
more on belly. Perineum yellowish white. Tail skin feebly pigmented: hair light. Digits 
pink, almost all hair on them white or pale. Snout and chin with pale hairs. Hair on inside of 
ears and behind them pale fawn: on outside dark or black. A few fawn whiskers.—3H2b° A 
good uniform lilac, but very pale (nearly white) at root of tail— 33a? Poor black. Much 
scattered pale hair on sides: a very small spot (3—4 sq. mm.) pure white on belly. Digits pink, 
with white hairs; some white hairs on dorsum of hands and feet. Skin of tail dark near base, 
the rest pink. Caudal hairs all pale. Hair behind ears nearly or quite white : hair inside ears 
fawn. Some pale vibrissae.—3//3a! Black: much scattered pale (white?) hair on sides and 
belly. Perineum nearly white. Only last 2 phalanges of digits pink. Hair behind ears dark ; 
within ears very pale. Some pale whiskers. Tail dark skinned with pale hair.—33a* Black, 
with little mixture of pale hairs. Digits pink, with white hairs. A pink portion, with white 
hair, near extremity of tail. Some pale whiskers.—3H8a’ Very pale “pearl grey” rather than 
“Jilac.”—3 363 More white than 3b2—3/3b! A good black above: a fair quantity of scattered 
fawn hair on belly, on fore-limbs and throat. Perineal patch pale fawn. Tail skin pigmented, 
hair fawn. Digits pink, hair white or fawn. Hair within and behind ears pale. Many pale 
whiskers.—3H3)° A very good black. Few scattered fawn hairs on sides, belly and throat. Tail 
pigmented except at tip: its hairs white or pale. Digits pink, hairs white or pale. Perineum 
with very little pale hair. Inside and behind ears, hair fawn: darker outside. Whiskers black.— 
3H3b° A good black above: dark slaty grey, with small median white patch, below. Digits and 
whole R. foot, pink with white hairs. Tail nearly all pink. Many pale whiskers.—3H30" Like 
30°, but with less pale hair, and with no pink on tail.—3/3b° A very good black: very little pale 
hair on belly and throat. Digits pink, with some pale hair. Tail pigmented. Whiskers black.— 
3H3c3 Died before sexing.—3H4a* Good black. Very few scattered pale hairs. Terminal 
phalanges pink, with white hairs. Perineal patch very dark. Tail pale-skinned, but not pink, 
with light hairs. Hair within and behind ears pale. Vibrissae black.—3/4a5 Very like 4a, but 
feet whiter, and tip of tail pink.—3H4b! A very small patch pure white in middle of belly. 
Digits pink. Terminal 12—15 mm. of tail pink.—3//4b? A good black, with scattered pale hairs 
ventrally. Perineal patch small. Only phalanges II and III pink, except on R. foot, where they 
are all pink. Tail slightly pigmented, with small pink patches ; hair fawn (?). Hair on and 
behind ears brown. Many pale whiskers.—3//4b? Almost exactly like 4b, but both hind feet 
have pink digits.—3//4c? Small white belly streak.—3H4c’ Dwarf.—3H5a! A very dark choco- 
late: eyes quite black.—3H5a? Good black. Some scattered (fawn ?) hair on sides and belly. 
Very little pale hair behind ears: hair within ears brown, Digits pink, with white hair. Tail 
feebly pigmented, with about 1 cm. at end pink: the hair fawn (?). Some pale whiskers.— 
3H5a3 More pink on toes than a’, and 4 of tail pink—3H5a* Less pink on toes than a’, and 
none on tail.—8H5b3 A good black: some scattered (fawn ?) hair below : perineal patch small. 
Terminal cm. of tail pink. Hair behind ears very pale. Digits feebly pigmented.—3 #5b4 Much 
as 5b3, but digits pink, and tail ringed with pink—35b" A good black, like 56°, but with the 
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digits pink.—3H6a? A good black: very little scattered light hair: a small white spot on 
forehead. Belly with much scattered white hair and a pure white patch. Hair in and behind 
ears brown. Digits and all L. hand pink with white hairs. More than ¥ the tail pink skinned, 
with white hairs. Whiskers mostly white-—3H6a? Rusty black: scattered white hairs on 
sides: perineal patch very pale: belly almost slate-coloured, with linear central streak of white. 
Tail feebly pigmented, the hair white. Hair in ears brown. Digits white. Some white 
whiskers.—-3H6a! Back and sides fairly good black. Snout and whiskers white, with white 
streak on forehead. Hair behind ears nearly white, within ears pale yellow. Digits and parts 
of dorsum of manus and pes pink, with white hairs. Tail pink, with patches of pale pigment, 
the hair white. Belly grey (ie. black with much scattered white hair), with large central white 
patch.—3H6b! Fair black above : much scattered white hair at sides and below. Tail pigmented, 
hair white. Digits pink with white hairs. Perineal patch whitish. Hair on ears yellow, nearly 
white behind ears. Whiskers black._-3H6b? A very slaty black: much dittuse white hair, and 
about a dozen conspicuous white hairs on forehead. Hands and feet with much white hair. 
Many white whiskers.—3 60° Very like 3H6b?.—3H6b' Very like 3H6b*.—3H6b° Median white 
belly patch.—_3 H6c! Mange.—3H7a! Very good black dorsally : perineum yellowish, belly with 
fair mixture of light hairs. Tail dark, hair yellowish except at the extreme tip, where it is pure 
white. Hair behind ears black, inside ears yellow. Skin of ears darker than usual. Digits dark, 
with extremely little light hair.—3M7a*? Much light hair on sides and belly. Perineal patch 
almost white. Hair behind ears almost white: within ears pale yellow. Much white on digits 
and on dorsum of L. foot. Tail with patches of dark and of pink skin, early all the hair white. 
Whiskers black.—3H7a! Almost identical with 7a?, but with less white on the ears.—3H7a® 
Almost like 7a’, but with more pink at tip of tail.—3H7b! A rusty black dorsally, the belly dark 
slate grey. All digits and dorsum of hands and feet pink, with much pale hair. Tail feebly 
pigmented, the hair pale. Hair behind ears pale, inside ears yellow. Many pale whiskers. 
Perineal patch yellow-white.—3H7b? Body a good black; slightly grey below, from a little 
scattered. pale hair. Perineum yellow-white. Feet and the digits of the hands white. Tail 
feebly pigmented, the hairs white. Very few pale hairs behind ears. A few pale whiskers.— 
3H7b® A rusty black above: belly grey, with faint linear white streak.—3H8a* A very rusty 
black. Hind feet and digits of hands white ; 2 of tail pink, with white hair on all the tail. 
Belly with much diffuse white hair, and a small white patch. Pale patch behind ear well marked. 
Hair within ears yellow. Some light whiskers.—3H9a? Generally a good black, with scattered 
whitish hairs at sides, and a line of thickly grouped whitish hairs, making a very pale band, along 
middle of belly. Perineal patch pale yellowish white. Pale tufts behind ears well marked : 
hair inside ears yellow: some white whiskers. Skin of tail dark except at tip, where it is 
bright pink : the tail hair white, Fingers white. The R. toes and part of dorsum of R. foot 
white; only terminal phalanges of L. foot white-—3H9«a° No white patch, but much scattered 
pale hair on sides and shoulders. The pale hair yellowish. Perineal patch yellow. Digits 
white. Extreme tip of tail white. Pale patch behind ear less conspicuous than in a2. Some 
whiskers pale.—3H10c? Eye-colour not noted (but not pink).—3H11a! Poor black, with much pale 
or white hair on sides, fore-limbs, and inside of thighs: rather less on belly. Digits white. 
Tail feebly pigmented below, darker above. Pale patch behind ears very small: hair inside 
ears yellow. Nearly all whiskers pale-—3H11la? Nearly completely white below (belly darkened 
with age).—3H1la* Poor black, but with little scattered pale hair. Perineal patch very pale. 
Hair behind ears nearly white: hair within ears yellow. Terminal phalanges of digits white. 
Tail with pigmented skin but with white hair. Some whiskers white.—3H11a‘ Almost like a‘, 
but with more white on toes.--3H11a® A pale ring on tail. A patch of densely clustered white 
hairs on belly. Otherwise like a.—3#11a’ Exactly likea®. Later ; developed much brown hair 
on shoulders.—3 126 Nine offspring born, seven eaten.—3H14a! A “chinchilla” mixture of black 
and white.—-314a5 “Chinchilla” grey like a1—3H14b! The black with a good many scattered 
white hairs. Cf. the chinchilla of 14a.—3H14b? The black with scattered white hairs.—3//14c! 
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“Grey ”=‘‘ chinchilla” mixture of dark grey or black hairs with white ones.—3H25a? Much 
white on head: fawn patch round each ear: fawn patch on L. side, and large patch on hind 
quarters.—3H25a> Head and much of shoulders fawn: white blaze on face. Large fawn 
patch on hind quarters: no lateral patch.—3H25a! Head fawn with white streak on forehead. 
Fawn continued backwards on left side as large patch. A large fawn patch on hind quarters. 
Much white on right side.—3H25a>° Head and shoulders fawn: muzzle white: hind quarters 
fawn: large blotches of fawn on L. side and over back to R. side. Much more colour than 
the others.—325a° Head and shoulder patch small. Snout white. Hind quarter patch small. 
—3H25a' Snout white. Head and shoulder patch larger. A patch on R. side. Hind quarter 
patch small.—3H25a8 Head with practically no white. Patch on hind quarters divided.— 
3H26a! Head patch covers both eyes.—3H26a? Head patch very small: area round L. eye 
white.—3H29a° Escaped.—3H31 Z 139 albino mother pure-bred for 3 generations.—3H31la! A 
fairly uniform, very dark black: the tail with some pink: the belly slightly paler than the back. 
Digits white haired. Whiskers black. Patch behind ears not very light.—3H31a? Perfect wild- 
colour, grey beneath.—3H31a* Perfect wild-colour, grey beneath, but the tail ringed with pink.— 
331a® With some pink on tail—3H31b' With much pink on tail.—3. #3168 With a little pink 
on tail.—3H33 The Z mother pure-bred for 3 generations._-3433a! Fair black, with few 
scattered light hairs. Belly with more light hair. Perineum nearly white. Tail very slightly 
pigmented with white hair. Digits pink, with white hair. Some white hair on dorsum of pes. 
Ears pale. Hair behind them very pale. Some white whiskers. Tip of muzzle with some white 
hairs.—3H33a? A very fair black: belly darker than in a: perineum yellowish. Some white 
on digits, but less than in a@!. Many white whiskers. Ears as in a!.—3H33a, 3433a* Almost 
identical with a2.—3H33a> Very good black. Perineum nearly white. Digits pink with white 
hair. Tail pigmented, with little pink patch near extremity. Ears darker than in males: a few 
white whiskers.—3H33a° Very like a> but no pink on tail and no (?) pale whiskers. Perineum 
and hair on tail yellowish.—3H37@ Killed by brother.—3H39b? A dwarf.—3H40 Ancestry 
black.—340a? Killed by brother.—3H42 Ancestry fawn. —3H42a! Disappeared and record 
lost.—3H50a? Dark lilac.—3H51b? Minute white ventral spot.—3H53a1 Mangy.—3H53c! Belly 
pale but not white.—359a1 Very dark chocolate. Eyes black.—3H59b! Ears malformed.— 
3H59b? Very small white ventral streak.—3H61la?, 3H61a° Small white belly patch.—3H64a? 
Very small white ventral patch.—3H64a* Very pale lilac.—3H65b! Small ventral white streak.— 
3H65b2 Small oblique ventral streak.—365b° Very small linear ventral streak.—3H65c3 Fairly 
large ventral patch.—3H65c! Small ventral patch.—3H65c° Very small ventral patch.—3 660°, 
3H66b1, 3H66b5 Very dark lilac.—3H66b°, 3H66b' Very brown lilac.—38H7la!, 3H71a?, 
3H71a3, 3H71a* Dwarfs. 


4H Matings. 


4H 1a! Extremely good black above, dark slate colour below. Longer hairs at sides, and 
perineal patch, yellowish. Tail pigmented, hairs yellowish. Digits pink, with pale hairs. Hair 
on and behind ears brown. Whiskers black.—4H 1a? Like 4H1a!, but the long hairs more 
nearly white and some whiskers white.—4H1a* Like 41a!, but more pink on feet.—4H2a? Good 
black above. Scattered hairs on sides and belly fawn. Digits pink.—4H2a° Like 4H2a*, but 
with less scattered fawn hair.—4H2c3 Tip of tail pink.—4H30! Fair black: little scattered very 
pale (?white) hair at sides. Perineal patch nearly white. Hair behind ears very pale yellow. 
Digits pink, hairs white. Tail pigmented, with white hairs. Many white whiskers.—4H30? 
Better black than 3b!; no white whiskers.—4H3b3 Good black. Scattered hairs yellower 
than in the other two: tail pink at end: toes much pinker than in 3b! or 3b2—4H3b° A very 
good black, with few scattered pale hairs. Some whiskers white. Digits pink with white hair. 
Tail pink except at base and tip.—4H3° Like 43b°, but tail pigmented.—-4H4a? Good black 
above. Tail pigmented. Digits pink, with white hairs. Some white whiskers. Ear hairs 
brown. Slaty grey below, with very small linear streak of nearly pure white (say 5 mm. x mm.) 
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on belly.—4H#4a3 Very small white blaze on forehead: white patch on belly.—4H4a* Minute 
white streak on forehead: larger but still very small white streak on belly. Most of the 
whiskers white. Digits pink with white hairs. Tail mottled pink and white-—4H4b! Narrow 
white belly streak. Tail pink.—4H46? White speck on forehead. Trace of white ventral streak. 
Tail pink.—4H6a? Blaze on forehead: large white patch on belly.—4H6b! Small white blaze on 
forehead, long patch on belly.—4H6b? No blaze on forehead: white patch on belly.—4H6b* Like 
4H6b?, but a smaller mouse.—4H6b* White blaze on forehead: white belly patch.—4H6b° No 
blaze. White belly patch.—4H6c? Streak on belly.—4H6c* Blaze on forehead; streak on belly. 
—4H7a Small white ventral patch, unusually far back.—4H7a? Very faint streak of scattered 
white hairs on belly.—4H8 Ancestry black.—4H12a1, 4H 12a, 4H12a° Dwarfs.—4H 12a! No 
record.—4H12a5, 4H12a® Dwarfs.—4H12a7 Omitted in Index, referred to in Mating Book, no 
record.—4H13b! Small blaze on forehead and very thin white band across haunches.— 
4H1306 Very small white blaze on forehead.—4H14a!, 4H 1407, 4H 140%, 4H 14a', 4H14a°, 
4H 140° Belly uniformly white.—4H15a? Dark lilac.— 4H 150%, 4H15a* Very pale lilac.— 
4H15b3 Very pale lilac—pearl grey. Killed (tumour).—4H 16? White face.—4H17a> Bred with 
one brother before sexing as 5H1.—420b3 Small linear white streak on belly.—4H20b' Belly 
white.—4H23a1, 4H23a? Eaten._-4H23b? Small white belly streak.—4H23c!, 4H23c? Very little 
white on belly.—4H24a! A very good black above: scattered white hairs below. Tail pig- 
mented. Fingers slightly pigmented, toes pink. Whiskers black. Perineal patch small. Faint 
trace of white line below appeared later.—4H24a? Small white line on back part of belly. Tail 
ringed pink and dark.—4H24a? Good black above: scattered greyish hairs sides and below. 
Digits pink. Whiskers black. Tail pigmented.—4H24b!, 4H24b?, 4H24)4, 4H24b°, 4H24c!, 
4H24c? Dwarfs.—4H25x This the only young of the first brood. Eaten. Next brood called 
4H25a.—4H25a! A good black. Very little scattered pale hair. Digits largely pigmented. 
Tail pigmented. Whiskers black.—4H25a? A good black. Very like 4H25a! but digits pinker. 
—4H26a? Much scattered hair (nearly white) at sides. Some white whiskers. Fingers and all 
feet pink. Tail pigmented.—4H26a° Face and most of belly pure white.—4H26a° Few scattered 
white hairs close together on belly.—427 Ancestry black. Father uncertain.—4H27c! Dwarf. 
—4H28 Ancestry black. Father uncertain.—4//28a1, 4H28a? Linear white belly streak.— 
4H29 Father’s ancestry fawn, mother’s ancestry black.—4H30a’, 4H30a’ Escaped while brood 
was being sexed.—433a? Much scattered pale hair ventrally. Tail partly pink.—4H36a! 
Mangy.—4H38, 4H39, 4H40 Ancestry black.—4H40a1 Very pale. Face nearly white.—4H41, 
4H45 Ancestry black.—445b! Very light f—4H45b?, 4H45b° Dark /.—4H46b! The palest 
“‘goloured” mouse yet seen.—4H49c? Killed by brothers.—451a* Very small ventral streak.— 
4H51a*® Very small linear white ventral streak.—4H51b* Perhaps a trace of ventral white.— 
4H52a3 Very small ventral streak.—458b? Nearest intermediate between lilac and chocolate 
yet seen.—4H63a' Albino with traces of colour on shoulders.—4H66a! Trace of white dorsal 
streak.—4H66a? Merest trace of ventral white streak.—4H66a° Trace of blaze on forehead.— 
4H66b? Small white ventral patch—4H69 Fawn ancestry.—4H69b! Small white patch.— 
4H69b? Belly white—4H71 Ancestry black.—4H75a? Record lost.—4H846° A few white hairs 
on belly. 


5H Matings. 


5H1a* Escaped.—5H3x Eaten soon after birth.—5 8a! Very small white ventral streak.— 
5H8a* Very pale lila.—5H8a! Small forehead blaze.—5H8b° Small white ventral patch.— 
5H8c! Dark lilac.—5H8c? Pale lilac.—5 H8c3, 5H8c! Dark lilac.—5H10a3 Very small ventral 
streak and forehead blaze.—-5H10a> Very small forehead blaze. No ventral streak.—5H11a5 
Escaped.—5 12a? Dwarf.—5H12b! Very small white patch on forehead.--5H14a! Few white 
ventral hairs.—5H 15a? Tip of tail pink.—5 15a? Tail lost.—5 715? Small round belly patch.— 
5H19b! Few white hairs on forehead.—5H19b° Very few white hairs on forehead.—5 H2902 
Very small white patch.—5 306° Extremely pale (nearly white). Ought to have a new colour 
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category.—5H30b! Normal lilac.—5 316°, 5H31b! Very small ventral group of white hairs.— 
5H31b§ Few white hairs on belly.—5H326! Poor black, but no white patch ventrally: small 
white patch on forehead.—5H32b! Scattered white ventral hairs.—5H35a! Escaped.—5H37a4 
Belly very pale but not white.—5H37b? Pale.—5H40a! Belly pale but not white—5H40a% 
Escaped.—5H40b! Trace of white ventral group of hairs.—5H52a* Escaped.-5H54a! Very 
pale.—5H54b! Very small ventral streak.—5H62a! A faint streak of white on belly.—5H82a4, 
5H82a°, 5H82a° A few white hairs on belly. 


-6H Matings. 


6Hla* White blaze on nose.—6H7b° A small patch of white hairs on belly.—6H11a? Small 
white patch on back.—6H15c*, 6H17c”, 6H17c°, 6H18a®, 6H25a1 A few white hairs on belly.— 
6H27 Fawn ancestry.—6H28 Black ancestry.—6H280? A few white hairs on belly.—6H28c® No 
record except skin.--6H29, 6390 Black ancestry.—6H23a', 6H34a!, 6H34a?, 6H35a°, 6 H35a!, 
6/135a° Very pale. 
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and Coupling Hooks with Experimental | 


Comparison with Existing Theory. By 
E. S. Anprews, B.Sc. Eng., assisted by 


Karu Pearson, F.R.S. Issued. Price 3s.net. | 


On some Disregarded Points in the 
Stability of Masonry Dams. By L. W. 
ATCHERLEY, assisted by Karu Prarson, 
F.R.S. Tssued. Price 3s. 6d. net. 


On the Graphics of Metal Arches | 


with special reference to the Relative 


Strength of Two-pivoted, Three-pivoted | 


and Built-in-Metal Arches. 
ATCHERLEY and Kari PEARSON, 
Tssued. Price 5s. net. 


By, lise We 
F.R.S. 


| IV. On Torsional Vibrations in Axles 
and Shafting. By Karn Pearson, F.R.S. 
Issued. Price 6s, net. 

V. An Experimental Study of the 
Stresses in Masonry Dams. By Karu 
Prarson, F.R.S., and A. F. CAMPBELL 
PoLuLARD, assisted by C. W. WHEEN, B.Sc. 
Eng., and L. F. Rrcwarpson, B.A. Issued. 
Price 7s. net. 

VI. Ona Practical Theory of Elliptic and 

Pseudo-elliptic Arches, with special refer- 

ence to the ideal Masonry Arch. By Karu 

| Prarson, F.R.S., W. D. Reynornps, B.Sc. 
Eng., and W..F. Stanton, B.Sc. Eng. 
Issued. Price 4s, net. 


Complete sets may be purchased at 25s. net. 


Drapers’ Company Research Memoirs—(cont.). 
Questions of the Day and of the Fray. 


The Influence of Parental Alcoholism 
on the Physique and Ability of the Off- 
spring. A Reply to the Cambridge Econo- 
mists. By Karn Pearson, F.R.S. Price 
ls. net. 


II. Mental Defect, Mal-Nutrition, and 


the Teacher’s Appreciation of Intelligence. 
A Reply to Criticisms of the Memoir on 
‘The Influence of Defective Physique and 
Unfavourable Home Environment on the 
Intelligence of School Children.’ By Davip 
Heron, D.Sc. Price 1s. net. 


III. An Attempt to correct some of the 


IX. Mendelism and the Problem of Mental Defect. 


IT. 


III. 


IV. 


WAL 


VII. 


VIII. 


IX. 


Misstatements made by Sir Victor Hors- 
LEY, F.R.S., F.R.C.S., and Mary D. SturGsE, 
M.D., in their Criticisms of the Memoir : 
‘A First Study of the Influence of Parental 
Alcoholism,’ &. By Karn PEARSON, F.R.S. 
Price 1s. net. 


IV. The Fight against Tuberculosis and 
the Death-rate from Phthisis. By Karu 
Pearson, F.R.S. Price 1s. net. 
V. Social Problems: Their Treatment, 
Past, Present and Future. By Karu 
Pearson, F.R.S. Price 1s. net. 
Eugenics and Public Health. Lecture 
to the York Congress of the Royal Sanitary 
Institute. By Kart Pearson, F.R.S. Price 
ls. net. 
VII. Mendelismand the Problem of Mental 
Defect. I. A Criticism of Recent American 
Work. By Davip Heron, D.Sc. (Double 
Number.) Price 2s. net. 

VIII. Mendelism andthe Problem of Mental 
Defect. Il. The Continuity of Mental 
Defect. By Kart Pearson, F.R.S., and 


VI. 


Gustav A.J AEDERHOLM, Ph.D. Price1s. net. 
III. On the Graduated Character of 


Mental Defect, and on the need for standardizing Judgments as to the Grade of Social Inefficiency 


which shall involve Segregation. 


By Karu Pearson, F.R.S. 


(Double Number.) Price 2s. net. 


Eugenics Laboratory Publications 


MEMOIR 


The Inheritance of Ability. By Epcar 
ScuusteEr, D.Sc., Formerly Galton Research 
Fellow, and ErHen, M. Evprrron, Galton 
Scholar. Price 4s. net. 

A First Study of the Statistics of 
Insanity and the Inheritance of the Insane 
Diathesis. By Davip Heroy, D.Sc., Form- 
erly Galton Research Fellow. Price 3s. net. 

The Promise of Youth and the 
Performance of Manhood. By EpGar 
ScuusteR, D.Sc., Formerly Galton Research 
Fellow. Price 2s. 6d. net. 

On the Measure of the Resemblance 
of First Cousins. By Eraet M. ELpERToN, 
Galton Research Fellow, assisted by Karu 
Prarson, F.R.S. Price 3s. 6d. net. 

A First Study of the Inheritance of 
Vision and of the Relative Influence of 
Heredity and Environment on Sight. By 
Amy Barrineaton and Karu PEARSON, 
F.R.S. Price 4s. net. 

Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man). Parts I and II 
(double part). (Diabetes insipidus, Split- 
Foot, Polydactylism, Brachydactylism, 
Tuberculosis, Deaf-Mutism, and Legal 
Ability.) Price 14s. net. 

The Influence of Parental Occupation 
and Home Conditions on the Physique of 
the Offspring. By Erne, M. ELpErton, 
Galton Research Fellow. Shortly. 

The Influence of Unfavourable Home 
Environment and Defective Physique on 
the Intelligence of School Children. By 
Davip Heron, M.A.,, D.Sc., Formerly Galton 
Research Fellow. Price 4s. net. Sold only 
with complete sets. 

The Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man). Part HT. 
(Angioneurotic Oedema, Hermaphroditism, 
Deaf-Mutism, Insanity, Commercial Abili- 
ty.) Price 6s. net. 


SERIES. 


xX. #$TheInfluence of Parental Alcoholism 
on the Physique and Intelligence of the 
Offspring. By Eraen M. Etprrron, as- 
sisted by Kart Pearson. Second Edition. 
Price 4s. net. 

The Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man). Part IV. 
(Cleft Palate, Hare-Lip, Deaf-Mutism, and 
Congenital Cataract.) Price 10s. net. 

The Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man). Parts V and 
VI. (Haemophilia.) Price 15s. net. 

A Second Study of the Influence of 
Parental Alcoholism on the Physique and 
Intelligence of the Offspring. By Karn 
Pearson, F.R.S., and Ere, M. ELDERTON. 
Price 4s. net. 

A Preliminary Study of Hxtreme 
Alcoholism in Adults. By Amy Barrine- 
Ton and Karu Pearson, F.R.S., assisted 
by Davin Heron, D.Sc. Price 4s. net. 


The Treasury of Human Inheritance. 
Dwarfism, with 49 Plates of Illustrations 
and 8 Plates of Pedigrees. Price 15s. net. 


The Treasury of Human Inheritance. 
Prefatory matter and indices to Vol. I. 
With Frontispiece Portraits of Sir Francis 
Galton and Ancestry. Price 3s. net. 


A Second Study of Extreme Alco- 
holism in Adults. With special reference 
to the Home-Office Inebriate Reformatory 
data. By Davip Hrron, D.Sc. Price 5s. net. 

On the Correlation of Fertility with 
Social Value. A Cooperative Study. 
Price 6s. net. 

XIX—XX. Report on the English Birthrate. 

Part I. England, North of the Humber. 

By Erne, M. EnpErton, Galton Research 

Fellow. Price 9s. net. 


XI. 


XII. 


XIII. 


XIV. 


XV. 


XVI. 


XVII. 


XVIII. 


Vol. I of The Treasury of Human Inheritance (VI+IX+X1I+XII+XV+XVI of the 


above memoirs) may now be obtained bound in buckram, price 52s. 6d. net. Buckram cases for 
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binding can be purchased at 2s. 9d. with impress of the bust of Sir Francis Galton. 


A photo- 


graph (11” x 13”) of Sir Francis Galton by the late Mr Dew Smith can be obtained by sending a 
postal order for 10s. 6d. to the Secretary to the Laboratory, University College, London, W.C 


LECTURE SERIES. Price Is. net each (Nos. III and X excepted). 


I. 


II. 


ITI. 


IV. 


The Scope and Importance to the V. ‘The Problem of Practical Hugenics. 
State of the Science of National Eugenics. By Karu Prarsoy, F.R.S. Second Edition. 
By Karu Pearson, F.R.S. Third Edition. VI. Nature and Nurture, the Problem of 

The Groundwork of Hugenics. By Kari the Future. By Karu Pearson, F.R.S. 
Pearson, F.R.S. Second Edition. | Second Edition. 

The Relative Strength of Nurtureand | VII. The Academic Aspect of the Science 
Nature. Much enlarged Second Edition. | of National Eugenics. By Kari Prarson, 
Part I. The Relative Strength of Nurture | F.R.S. 
and Nature. (Second Edition revised.) By | VIII. Tuberculosis, Heredity and Environ- 
Erxet M. Enpertron. Part lI. Some Recent | ment. By Kari PrEArRsoN, F.R.S. 
Misinterpretations of the Problem of Nurture IX. Darwinism, Medical Progress and HEu- 
and Nature. (First Issue.) By Karu PEaArR- | genics. The Cavendish Lecture, 1912. By 
son, F.R.S. Kart Pearson, F.R.S. 

On the Marriage of First Cousins. By | xX. ‘The Handicapping of the First-born. 


EruHet M. ELDERTON. 


By Karu Pearson, F.R.S. Price 2s. net. 


The following work prepared in the Biometric Laboratory can be 
obtained as a Government Report from Messrs Wyman and Sons, Ltd. 


The English Convict, A Statistical Study. By Caries Gorrine, M.D. 


Text. Price 9s. Tables of Measurements (printed by Convict-Labour). Price 5s. 


THE UNIVERSITY OF CHICAGO PRESS 


THE EVOLUTION OF SEX IN PLANTS. (The University of Chicago 
Science Series.) By JoHN MERLE CouLTeEr, Head of the Department of 
Botany in the University of Chicago. 


Pp. vui+ 140. Small 12mo, cloth. 4s. net. 


In this first volume of the new “University of Chicago Science Series” Professor Coulter. the editor 
of the Botanical Gazette and the author of numerous volumes of botanical science, has given a presentation 
of the results of research showing that all reproduction is the same in its essential features and methods 
of reproduction are natual responses to the varying conditions encountered by plants in their life 
histories. Sex reproduction, the author says, is simply one kind of response, the sex feature not being 
essential to reproduction, but securing something in connection with the process. Various phases of 
the subject discussed include the evolution of sex organs, the alternation of generations, the differentiation 
of sexual individuals, and parthenogenesis. The last chapter, which offers a theory of sex, reviews 
the more prominent facts set forth in preceding parts of the volume, and serves both as a summary and 
a working hypothesis. 


ARTIFICIAL PARTHENOGENESIS AND FERTILIZATION. By 
JACQUES Loges, Member of the Rockefeller Institute for Medical Research. 


Pp. x + 312. 10s. net. 


The first complete treatment of the subject of artificial parthenogenesis in English. It gives an 
account of the various methods by which unfertilized eggs can be caused to develop by physico-chemical 
means, and the conclusions which can be drawn from them concerning the mechanism by which the 
spermatozoon induces development. Since the problem of fertilization is intimately connected with 
so many different problems of physiology and pathology, the bearing of the facts recorded and discussed 
in the book goes beyond the special problem indicated by the title. 


12mo, cloth. 


The University of Chicago Press 


The Cambridge University Press 
Agents for the British Empire 
London, Fetter Lane 


Journal of Anatomy and Physiology 


CONDUCTED BY 


SIR WILLIAM TURNER, K.C.B. ARTHUR THOMSON, University of Oxford. 
ALEX. MACALISTER, University of Cambridge ARTHUR KEITH, Royal College of Surgeons 
ARTHUR ROBINSON, University of Edinburgh 
VOL. L 


ANNUAL SUBSCRIPTION 21/- POST FREE 


CONTENTS OF PART I.—OCTOBER 1915 


.Professor Freprertc Woop Jones, D.Sc. The Homologies of the Chelonian and Mammalian 
Types of Genitalia. Davrip Waterston, M.D. Developmental Changes in the Pericardium, the 
Mesocardia, and the Pleural Sacs in the Human Embryo. F. J. F. Barrineton. The Mucinous 
Changes of the Vaginal Epithelium of Certain Mammals in Pregnancy. WatrEer E. CoLLines, 
M.S8c., F.L.S., ete. Some Abnormal Developments in the Vascular System of the Frog (Rana temporaria). 
Professor ARTHUR THOMSON. On the Presence of Genial Tubercles on the Mandible of Man, and their 
Suggested Association with the Faculty of Speech. J. Ernest Frazer, F.R.C.S., and R. H. Ropsrns, 
M.D. (Cantab). On the Factors concerned in causing Rotation of the Intestine in Man. 


LONDON: CHARLES GRIFFIN anp COMPANY, Lrp., Exeter Street, Strand 


JOURNAL OF THE ROYAL ANTHROPOLOGICAL INSTITUTE 
Vol. XLV. January—June, 1915 


Contents :— 


Minutes of the Annual General Meeting, January 26th. The Bronze Age Invaders of Britain. 
The Presidential Address. By A. Kurru, M.D., LL.D., F.R.S.  Bori Beliefs and Ceremonies. (With 
Plates land II.) By Major A. J. N. TRemMEARN®E, M.A., LL.M., M.Sc., Dip.Anth. Notes on the Initia- 
tion Ceremonies of the Koko, Papua.. By E. W. P. Curynery and W. N. Braver, F.R.G.S., F.R.A.I. 
Stone Implements from South African Gravels. By Major E. R. Conuins, D.8.0., and Rucinanp A. 
Smirn, F.S.A. Saxon Graveyard at East Shefford, Berks. (With Plates III—IX.) By Haronp 
Peake and E. A. Hooton. Beliefs and Tales of San Cristoval (Solomon Islands). By ©. E. Fox and 
F. H. Drew. 


185 PAGES, WITH 9 PLATES AND MANY ILLUSTRATIONS IN THE TEXT. 
PRICE 15s. NET 


LONDON: THE ROYAL ANTHROPOLOGICAL INSTITUTE, 50, Great Russell Street, W.C 
General Agent: FRANCIS EDWARDS, 83a, High Street, Marylebone, W. 


OR THROUGH ANY BOOKSELLER 


MAN 


A MONTHLY RECORD OF ANTHROPOLOGICAL SCIENCE 


Published under the direction of the Royal Anthropological Institute of Great Britain and Ireland, 

Each number of MAN consists of at least 16 Imp. 8vo. pages, with illustrations in the text 
together with one full-page plate; and includes Original Articles, Notes, and Correspondence; Reviews 
and Summaries; Reports of Meetings; and Descriptive Notices of the Acquisitions of Museums and 
Private Collections. . 

Price, 1s. Monthly or 10s. per Annum prepaid. 
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ROYAL ANTHROPOLOGICAL INSTITUTE, 50, Great Russell Street, W.C. 
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ON THE GENERAL THEORY OF MULTIPLE 
CONTINGENCY WITH SPECIAL REFERENCE 
TO PARTIAL CONTINGENCY. 


By KARL PEARSON, F.R.S. 


¥ (1) Let there be/ variates or characteristics A, B,C, ... L, each of these variates 
or characteristics being subdivided into categories A,, Ay, ... Aa, By, By, ... Bg, 
C,, Cg, ... Cy, ... Ly, Ly, ... Ly, where a, B, y,...A are arbitrary numbers. Then if 
N be the total population, and na,, na,, ... Naa the number of individuals in the 
A-categories ; Mp > Np,» ++» Nb those in the B-categories and so on, we have relations 


S47 (%q,,) = SyP (mp,) = Sy (M%,) = «= N. 


Further, if there be no relationship whatever between the variates or characteristics, 
we should anticipate that the frequency of the group A,, B,, Cy, ... Ly, in a sample 
of M would on the average be 
FA og 0S A A 
N a NN 

Actually we find in the sample M the number m,,,,..,, and the problem arises 
whether the system represented by mz, ..y 18 so improbable that in the selected 
population M the characteristics A, B, C, ... L cannot be considered independent, 
i.e. M is really not a random sample from the supposed population N. Clearly 
the answer to this problem has already been given. We have to find the value 
of x7: 


= Sun. Tie i ae (1), 
Ne WN. N 
and apply the tables for “ goodness of fit.” Of course in many cases the sampled 
population is not known and accordingly we can only put for x, Re: 7 the 
values given by the sample itself, i.e. a Fei G zi af , and test from this substitu- 


tion the degree of divergence from independence. If we take the mean value of 


x2, Le. 6% = x?/M, ¢? is termed the mean square contingency, and C, = V¢2/(1 + 4?) 
Biometrika x1 aC 
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gives a measure of the divergence from independence*. This is a multiple con- 
tingency coefficient. 


Another case not infrequently arises; the population N has the characteristics 
A, B, C,... not independent, but related; the cell wow... % contains Nypyy»,.y and 
the question arises how far it is safe to consider the population M as a sample 
of this population. In this caset 


m ut Nn 
UW. AT wow... W 
Nees TS) al .(11) 
UW... M ole se e(elele clelelvleisicleielsieleleele 
N Nurvw... 


In both these cases we have the relation 


Suow...¥ (oven ath) | / INR SRS 505 0050.40° (11), 


and accordingly the number of cell frequencies is one more than the number of 
independent variates. Thus in using the tablest of “goodness of fit” the n’ of 
the argument is afy ... A, but the value of P, the probability, has actually been 
determined from n’ — 1. 


(2) Now there are a number of cases in which not only do the cell-contents of 
the sample obey the linear relation (iii), but also other linear relations are imposed 
on the cell-contents. In the most general case we can suppose q linear relations 
between the cell-contents m,,,...y, and obtain the probability P corresponding to 
a value of x? limited by these q relations. The theory of sampling, when such 
conditions are introduced, I term the theory of partial contingency. The reason 
for this terminology will be clearer as we develop the theory. 


As far as we are concerned at present, it is of no importance whether we are 
dealing with one or other of our two cases, i.e. whether we are questioning the 
possibility of our material being a sample from a population with independent 
A, B, C,... L characteristics, i.e. determining a coefficient of mean squared 
contingency, or are investigating the possibility of its being a probable sample 
from a population with any associations between these characteristics. We can 
accordingly write y? in the form 
((Murw...y — Murw...v)”| 
( Muvw 
or for convenience we may even drop the descriptive subscripts and, numbering 
the cells in some sequence 1, 2, 3, ... s, ... (aBy ... A), write 


x2 - any W 


* Tf the characteristics may be assumed to be continuous variates, certain corrections for units of 
grouping can be made. ‘There is also a correction due to the necessarily positive value of ¢%. These 
corrections, which have been for some time in use, will be considered elsewhere. 

+ We must in this case of course actually know the value of nyyzy,,.y, it cannot be judged from the 
sample. 

{ For discussion of the deduction of P from x2: see Phil. Mag. Vol. u. p. 157 (July 1900); and for 
Tables: see Tables for Statisticians and Biometricians, Table XII (Cambridge University Press). 
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Here Myyy...y = ms will be either 


M 
M.-H eee N or WN uvwe we 


as the case may be. 
Now we shall find it convenient to write 


DRE C57 6 A ORE ee eee (vi), 
and thus - 
52s mat Sha 6 3G) teas Dinan Cesare ce Ree toner (vil). 


Further, if our q linear equations be of type 
et ala Mg to lenst to Weg g Ay 550 Ud gincainanascasdveses (viii), 


where f and H are constants and ¢ takes every value from:1 to qg, we can write 
our conditions in the form 


ee. + kinpXe + eee + ee. + sic IK 


where ae 
k his Viti 7 
ts Vi? My + h2 57s Tete ee Oe eel (1x). 
and ike A, — hym, — hyn, — ... —higm, — -.. 


Vim, + gM, + ... + h?,,, +... 
We shall speak of the first of (ix) as the prepared condition. Clearly it corresponds 
to a plane in n-dimensional space in which the constants ky, kj, ... kis, -.. are the 


direction-cosines and K, the perpendicular from the origin on the plane. It is 
convenient to use the notation 


kakey + Kiko -|- eee -|- Kiskus + -.. = COS (tt’) eee cer eee voc ene (ax), 
for (ét’) is now the angle between the tth and ¢’th planes. 


Assuming that the frequency surface with which we have to deal may be taken 
as 


we may suppose before applying equations of condition (ix) that K,, Ky,...K;,...K, 
are variates and that we eliminate X,, X,,...X,, expressing our x” in terms of 


Gis) 2 Kg, Xoaas Xgyo, --.%,-  Weshall have then 
z= % expt. — } (quadratic function of these m new variables). 


We now proceed to put K,, K,,... K, constant, but leave the other n—q 
quantities to vary; we are therefore seeking the value of y? for certain variates 
constant. This is the essence of partial contingency, and the analogy in con- 
tingency to partial correlation. 


(3) As a rule in partial contingency we do not seek to discuss xy? when single 
cells of our multiple contingency solid are constant in frequency, although our 
theory covers that case. What we require usually is the value of y2 when we make 
the contents of certain marginal total cells constant. For example, let us consider 


10—2 
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a population of uniform sex and let there be three characteristics: (i) frequency 
of age groups, (ii) frequency of occupational categories and (iii) frequency of survival 
and of death, the latter classified by various special disease classes—the whole, 
say, representing the returns for one year of a large area or country. We then 
require to determine, whether the like contingency solid for a sub-district, or for 
another population entirely, may be considered as significantly different from the 
above general population, i.e. we require to find the probability of its being a 
random sample of this general population. Now we may do this in the most 
universal manner, by assuming that not only survivals and deaths, but that age 
groups and occupations all have frequencies, which are random samples of the above 
general population. But this is not very often what we require; we admit that 
the age distribution is differentiated, we admit that the occupational frequencies 
are peculiar to the locality, and we ask whether, notwithstanding these differences, 
the death distribution is to be considered as a random sample. 


In other words, we do not only fix the size of our sample M; we fix all one 
face—that of age groups and occupational categories of our contingency solid— 
and ask what is the distribution of samples of M taken from this solid, subject 
to the linear conditions that the totals of age-occupational categories are constant. 
For example, if A be age and B occupation, we make Nq,p, Constant for all values 
of wand v, but 

Ui = Nayd, Cy ay Nab, Co a Naybd, Cs aipinsets 
where %q,,b,c, 18 the frequency of the wvs cell, c denoting the category f of type of 
death and survival. 


Now clearly in making this investigation we shall be studying the mean square 
contingency and the resulting probability of a partial sample—a sample of survival 
and death-type in a population of constant age groups and occupational classes. 
Again, we might treat a population as a sample with only constant age groups or 
only constant occupational frequencies and again investigate its probability as 
a sample with regard to deaths and occupations or with regard to deaths and age 
groups respectively. These would be partial contingencies of the order a or B, 
while the previous partial contingency was of the order a + f, a being the number 
of categories in A (i.e. age groups) and f being the number in B (i.e. mortality 
and survival classes). 

(4) Now the value of y? given above, and of the frequency surface (xi), was 
discussed by me in the year 1900* from the general normal frequency surface by 
evaluation of determinants. The demonstration depends on two hypotheses: 


(i) The approach of the binomial M (p + q)” to the normal curve 


. _M 
Vin Va 


We know that this is true if n be considerable and neither p nor q very small. 


eed 
e 2npq. 


* Phil. Mag. Vol. L. p. 157, 1900. Ihave recently given a more elementary proof with the probable 
error of P: see Phil. Mag. April, 1916. 
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The hypothesis is justified therefore if no cell be taken so small that its contents 
are very small compared with the size of the sample. 

(ii) That the sampling takes place out of a population indefinitely greater 
than the sample. If this be not true, then the distribution of frequency of any 
given cell for a series of samples follows not a binomial but a hypergeometrical 
series. The necessary modifications in the formulae are not very substantial and 


have been discussed elsewhere*. 
Supposing the above two conditions to be fulfilled, then in true random 
samples the mean of a cell frequency will be 


mi, = M5 id eee eT (xii), 


where M is size of sample, n, the contents of the sth ceil in the sampled population 
N, and condition (i) amounts to saying that no cell is to be chosen so that n,/N 
is indefinitely small. 


Further, the frequency of the sth cell will follow the binomial 


Ng TaN ae 
WtQ-H)} 
and thus have a standard deviation given by 


Se (1 *) BN techy Beas Gaul 


N N 


Lastly the correlation 7,, between deviations in the sth and s’th cells is given by 


On Fn "ss! = — Ni N ateletsiescierstafeleyevele oterejs\cieiersievsiele(ete 


(5) The following deduction of the value of x? is a variant from my Phil. Mag. 
proof. I owe the suggestion of it to Mr H. E. Soper, although I have deviated some- 
what from his track. Let an indefinitely large population N consist of the classes 
Cy, C, -..C, im the quantities nj, n,, Ng, ... n, respectively. Then p, = ,/N = 
chance of drawing a member of the class (,, and the standard deviation of the 
distribution of frequency in samples of M drawn from the population will in this 


class C, be as above Ye: 
Cp VEIT (yap enter re eee (xiii) bis, 


Further, the mean of samples for this class will be Mp, by (xii). 


In the next place the correlation between deviations from the means in classes 
C, and C;, will be in our present notation 


Gs Oeligg EM Wy ING cc cciudsdnctssceocssoctands- (xiv); 

te V De Ps! 
or by (xii)Pis a Xv). 
y (xi) e/a) (xv) 


* Phil. Mag. p. 239, 1899 and Biometrika, Vol. v. p. 174. 
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The distribution of frequency of the different classes in these samples of M will 
be given by the terms of the multinomial 


(po Co + pC, + pCa. 4p, Cpe eee (xvi), 
the general term being 
M! a ae Ps 
Riu aaa aaa Po Pr Pot = Pie CoC aC eee (xv), 


where Cy, C,, ... C, are only logical symbols to denote that this general term is 
the frequency of the group, where the class C, occurs u, times in the sample. 
Clearly uy + u44+...+u,= M. 

But (xvi) may be put into the form of a binomial, it is 


m=M M! aw ath 
* As m!(M — my)! (PoC) —™ (p1Cy + pola + ... + p,C1)™...(xvili). 


Let (py + pot... + pi) =A, then the mth term of the above series may be 
read as 


M! ; i i ; : 
m! (M — m)! (CoC Am (Py Of + Pe Ge + eee + Pi Cyn eee eee (xix), 
where Pi t+ pe t...+p/=1, 
Pi _ Pe Ps Pi 
and 65 = SS =, SS A Po ee ee 
Pi P2 Ps Pi Po Ge) 


Now it is clear that the factor 

(py Cy + po Cot ... + pi Ci)™ 
gives the frequency distribution of samples of m drawn from a population of 
indefinitely large size of which the proportions of the classes C,, Cy, ...C, are 
Pi, Po, --- Py and in which no class Cy occurs. But by (xx) these proportions are 
the same as in the original population which contains C). 


Hence if we take samples of M from an indefinitely large population with 
classes Cy, Cy, Cy, ...C,, those that contain m of the classes C,, C,,...C, will be 
distributed in the same proportions as if we had extracted m from an indefinitely 
large population consisting only of those / classes in the same proportions. 


Now thus far the nature of the class Cy is at our choice. In the original popula- 
tion N it appears with the total frequency m. Let n)+’ = N, and suppose 
n) is indefinitely greater than n’, then py will be indefinitely greater than 
1» Po» ++» Pn» It follows that p, if s be not zero is very small compared to 
unity, because 

Pot Pit Pot + Pn= I. 
Hence in such a system from (xiii)>is, if s be not zero, 
CS Vee ene (Esz))), 


and from (xv) to the same degree of approximation 7,,=0. That is to say, if 
py be large in taking samples of size M from an indefinitely large population, there 
will be no correlation in deviations in the frequency of the classes C,, ... Cy. 
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On the other hand 7, is not zero, but equals by (xv) 
—V psPolV1 — po = — V'p./V1 — Po 


to the same degree of approximation. Hence by (xx) 


But if we form the partial correlation of deviations in classes (, and C, for constant 
frequency of number in Cy, we have 
Osea lee — Venti) V (Li Tsp) (L197) 


= (0 —Vp. Vp: )/V (1 — ps) 1 = 9), 

V ps De 

o 1s = SS Se Pan non conn oonanonne 
: a V1—p/ V1 — p/ ( 


which agrees with (xv), the classes being now reduced by unity. Further, the 
reduced standard deviation must now be 


00s = O51 — To," 


Sen 
= V Mp, (1 — ps’) 
SBI oe (Clie Sh yes) has estas) co sive wees ess (xxiv). 


Now take the mean value x, of x, the frequency in class C, for a constant % or 
for constant frequency in class Cy; in our case, if the sample is to be m this will 
be M — m, we have 


os = o Ss 
oXs — Xs = Toe = (Xp — Xp), 


oo 
o@, — Mp, = We 5 (iM —m— Mp), 
os = Ds {ar = — i [M (1 — py) — ml 
Oz ots = - STUD a tee PAM Mice sade Gein vests (xxv). 


The partial values 97,, 9r,, of the means and correlations of classes for constant 
number in class Cy are given by (xxiil) and (xxv), and are what we might anticipate. 
But (xxiv) should be Vmp,' (1 — p,'). It is accordingly needful to take 


NV: 
or Oe Nee UME Sate chet cacitsSanesen ose sate weses (xxvl). 


These results have been reached on the assumption that py is very large as 
compared with p,, p.,...p,- It follows accordingly that the sample M must be 
large as compared with m, and further the sum of the classes C,, C3, ... C; must 
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be to that of the class C, in the total population in the ratio of the partial sample 
m to the total sample M. Without this condition it is not possible to replace 
Mp, by mp,’. Assuming these conditions to be satisfied, then samples of the 
size m in classes C,, C,, ... C, picked out of very large samples of M will reproduce 
the same distribution of frequencies in those classes as samples of m picked out of 
an indefinitely large population with the same relative frequencies in those classes. 


But in the case of samples of M, the deviations have their correlations zero 
for the classes C,, Cy, ...C,, or they will be approximately distributed by the 
product of their independent probabilities. The standard deviation being V mp,’ 
and the mean mp,’, we see that the frequency distribution would really follow 
a Poisson’s binomial limit, but as shown by L. Whitaker* this binomial limit is 
approximately Gaussian with fairly low values of mp,’; see the Diagrams for 
mp; = 10 and = 30 in the plate of her memoir. We may accordingly therefore 
take the distribution of the frequency in the sth class or cell to be given by 


_1 (m=)! 


2. 2 Ms 


= Jan V ia. (POEs SY bdomeeooonbousconDODODOC (xxvll), 


and the general distribution to be 


1 u ] 1 x? 
om ( =) ae er | 
V2) Vm py po’... py 


So hw.) ayia ont enegees (xxvull) 
where aoe aa =a | 
Here, if the size of the sample only be fixed, we shall have 
S (1) — i — 1S.(p,)i— sin), 
or S (m, — M;) = 0. 
m,— ™; 

If we take C= Ze ‘ 
we have: pate. hgh ey. Octet eames GI hes aorn See oadecousGce: (xxix) 
subject to the condition: 

N/ iy Xa WV ig Meg oe Ag Ones reee (xxx). 


It is clear that x? equal to a constant gives a sphere in J-fold space, and that 
(xxx) is a plane passing through its centre, and therefore cutting the sphere in /-fold 
space in a sphere of the same radius in (J — 1)-fold space. Hence if we desire to 
find the volume of the frequency surface (xxviii) which lies outside a value of 
X = No Subject to the condition (xxx), all. we have to do is to transfer to polar 


* Biometrika, Vol. x. p. 36. If m,=mp,’ be the mean, B,=1/m,=£,—3, so that B,=-03, B,=3-03 
already for m,=33. 
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coordinates and integrate the value of z for the (J — 1)-fold surface beyond the 
value y,*. Accordingly 


is the chance of a sample occurring with as great or greater deviation as the 
xo sample from the general population. This is the expression from which the 
Tables of ‘‘ Goodness of Fit” were calculated, the arguments being y,? and l,i.e. the 
value of x? for the sample and the total number of categories in the sample. 
Thus far there is only difference of method of deduction, not of results. 

(6) We now propose to replace condition (xxx) by a series of q linear equations 
of form (vii). These in the case of sampling will, if the size of the sample be 
fixed, either directly or indirectly involve (xxx). 

The type of these equations in their prepared form is 

kaX,thepXe+ s.. + hy X, +... Ky. 
Each such plane will intersect 
Get aa IGE iret at eee, Ghee ae 

in a sphere of lower order. For example, if there be » variates X, the first plane 
gives a sphere of the (n — 1)th order, this will be intersected by the second plane 
in a sphere of the (n — 2)th order, so that ultimately we find ourselves reduced to 
a sphere of the (nm — q)th order, by the intersection of the qth plane. If K,, 
K,, ... K, were all zero, the radius of the sphere of the (n — q)th order would be 
the same, i.e. x? as the radius of the sphere of the nth order. But since these 
quantities are usually not zero we have to determine the radius of this sphere. 
The centre of this sphere must lie in every one of the qg planes of the nth order, 
and accordingly on the plane of order n — (q — 1) in which they intersect. But 
the centre of the sphere of n — q order is where the perpendicular, K, from the 


origin meets this plane of the (n — q — 1)th order, and the radius y’ of the sphere 
of the (n — q)th order is given by xy’? = x? — K®. To determine y’ we must find P. 


Now P will be the minimum distance from the origin to the plane of the 


(n — gq — 1)th order in which the q planes intersect. In other words to find P we 
must make 
Dt= X74 X74 ..4+X2+..4X,2 


a minimum subject to the q conditions of type 
key Xy + hig Xot+...+hyXst ... thyX, = Ki, 
where Kay? + Keg? +... + hey? +... + hy? = 1. 
Using the method of indeterminate multipliers we find » equations of the form 


Xe Akay + Nghe, Fw. + Aghkes = 0. 


* Phil. Mag. Vol. u. p. 158. 
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Multiply by X, and add the series, and we find 


eT eet ee eee (xxxll). 
Multiply by &,, and add the series, and we find, by aid of (x), 
=k Ay + A, cos (12) + A cos (13) + ... + A, cos (1g). 


Similarly : 
— K,=A,cos(21)+ A, +A; cos (23) + ... + A, cos (29), 


CC i ee ee ei i ee ei ee ey 


— K, = i, cos (q1) + A, cos (g2) +... + ... +... +  AQe 
These are g equations to find A,, A,,... A,, and we can then substitute in (xxxil) 
to find the required P?. 


Now consider the determinant 
Ji =| “ily kK Kee TCE re he Bea (:9.0-21 0) 5 
Kee 1 cos (12), cos (13), ... cos (1q) 


and let us call the first row and the first column the 0 row and 0 column, then 


clearly 
—A,= — Rot/Roo; 


where R,, is the minor of the sth row and ¢th column, and 


= Kee, + K. Roo t+ eee + K,Ro: + eee + LG Tie 


Kk? é 
Roo 
De Roos 
or 1 Ga bares Cal byt eoP Gah cap nSnanoneeRDEsoG%0 (xxxiv). 


If we call A the minor Ro, we have: 
Ro: ee KA = KGIANs: = ieielehs aaa KOA: nite fe lene 
7 Att 7 Avy 
Thus KR=8s (x 2 =) +28 (KK : =) Rene: (xxxv). 


From this we deduce that the probability of a sample which gives x? = X02 
with q linear conditions must be obtained from 


i oc EP yn dx 
Ye a SeesO OB 960009 (XXXVl). 


i See ty no dy 


0 
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We must therefore in order to find P enter the Tables of ‘Goodness of Fit”’ 
with x? = y,?— K? and with n’=n—q+1*. 


The reader who is familiar with the theory of the multiple total and partial 
correlation coefficients will note how closely analogous the formulae (xxxili) to 
(xxxv) are to results in that theory. The fundamental determinant of the 
correlation may be made to agree with A, if we merely write 7,, = cos (ss’). 
In fact both theories really reduce to the discussion of the formulae of spherical 
trigonometry in multiple space. 

(7) A simple illustration of the above formulae may be taken from the case 
of the distribution of mortality in two districts, where the problem is to ascertain 
the probability that the difference of mortality observed allowing for the frequency 
of the age groups could be due to random sampling. 


Let the population sampled be represented by 


(a) ND ada cee: Dale ts D,| A 
A ee polit | aa eae DEN 
Veale ie hates a ee Ae 


where D, are the dead, L, the living and A, the exposed to risk in the sth age group, 
there being wu such groups in the total population P, of whom in the given period 
A die and A survive. 


Let the districts be represented by 


(8) Chall GA Ne al ace Geers d,, | 
(Eg Aa eal meee Ao ore see Ihe 
Gy NWA gn| Gage Been ee Gaerne: a, | Pp 
and 
(y) Cnaa i scalladnt Nemec. de le eeceas a, | oO 
Pian lec lcbat Pile esac Te aceite a ie. 
Dae Age \ One Wr atest WiGie tens Gi, Wap: 


respectively. Then the problem is to ascertain the probability that the last two 
distributions could both be samples of the same first population. 


The general formula has been given by me}; we have to evaluate 


fi ten? 
Gad 


(6+5)5 
(0 eye 


* The tables are constructed for n’-1 independent variables; in our case there are n—q such 
variables, hence n’-l=n-q. 
{ Biometrika, Vol. vu. p. 252. 


es 
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where f, must take the value of every frequency in the cells of (8), f; the corre- 
sponding cell frequency for (y), and F, that for (2). We have to seek for P under 
the argument n’ = 2u in our tables, if we have no restrictions on our variates. 
Actually we have such restrictions, for we are going to seek the partial x? when we 
suppose the age groups in each sample to be constant. Now 
d, Ele G4 and edema 

Thus it follows that 

OOS AE ee) He 

sieioes dia aipciiantes acini ae (xxxvii). 
POP) sy eee gee 
We have accordingly w equations of condition or n’ as argument will be 


reduced to u ai 1. Now we take 
C =) 
pp’ P pl 


a = , == ’ 
pe en DEP 
(6 i) 
Vf See eae. 
i pt+p VLJP ye 
Thus x2 1S" (X,*) + Sy% (Ye); 


with uw conditions of form 


<(DYP X ei/efe aen e 
aerbt 
or in the prepared form 


hy | Os 
oes oe y.- nmin, yr) 
a ea aes 
ahve ie = «| is “s) / Ea A, 
pt Pp G v) 


Further, all the cosines like’ cos (ss’) are zero, for no equations of condition 
involve the same variates*. Thus 


Ke Ke + Kke4 + K,? 
'\ 2 
= =e So ir ie =) RENE re tcihs Sm (xxxviil). 
Accordingly 
; Bide adie IP ly Ne Pa, eae 
2— y2_ K2 — pp’ | u (“- u s =| u Suan yf. 
Me = pee 1 9D aan ae i) ee Ae P 
- lL,’ 
We shall now substitute from the relation (xxxvii), getting rid of oe 


* This will necessarily be true if our equations of condition refer to parallel rows or columns, not if 
they refer to certain rows and columns. 
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oe Pp’ PA, @ as eye @ iN (2 w) PD, (2 a 
; Pa, ane p ery Pip) Aap pS 


lee ae | 
=e oa Pp’ mG py ; 


D, L, : 
Let Ae Bg q;, then we may write 
ny Jue @ pet Sadat Pits 9) 
eee >. Aupey, Sp p J 


Now if we know the population sampled, we have only to insert the values 
of P, A,, and p,, q, to obtain the value of y,?._ But if we do not, and this is usually 
the case, then our problem is: Are the two districts samples of the same unknown 
general population? For example, we might enquire whether the death distribu- 
tions in Bradford and Leeds were, correcting for age, significantly different. It 
might be supposed that it would be correct to give A,/P, p, and q,, the values 
found for all England and Waies. But it may be doubted whether this would 
be satisfactory. The populations of Bradford and Leeds might fairly be considered 
as samples of a general population which is very far indeed from being that of all 
England. Accordingly it seems much more reasonable to suppose that they are 
samples of a population whose mortality characters will be best represented by 
the combination of those of the two districts themselves, or we take 


d, + d,’ Ay dan @,. 
Gea P pt+p- 


Dos 


Substituting the first of these relations we find 


a; As (as ay 
= =e aa, (——-—-— ' 
Xo” is £ Ma zs . (xxxix) 
i aaa | a, + as A, ; d, aS a Ganda 
£S/\ d+ (1-2) 
p a Pp as as 
If we substitute the second relation, (xxxix) becomes 
as a, , d, d,’ Z 
pp (ee) 
TE a ae ()} 
(Gt ee\" a+ a) (1-2 4“) 
PAP a; + As 


and this, I take it, is the best measure of significance in the difference of the 
death distributions allowing for age groups in the two districts. 


It is clear that the first factor will in many cases differ but little from unity. 
For we should anticipate that approximately 


A 


= 2 gay 
as alee a, P 


Ie 
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Hence it would follow that 

a, a,” (3) (eee)! 

PDE NE [i a 
or the first factor is approximately unity. The value of y,? then becomes 


( a,a,’ (2 — &) 


a, as 
Q? = 8," { a+ ay(1 225)! BeRee ana chon sa: (xh), 
| ae a, + a) 


a quantity which I have shown elsewhere* has an important relation to the question 
of the significant difference of corrected death rates. 

It is obvious that the present method is of wide generality and I propose to 
illustrate it later by considering whether the general death distributions of various 
national groups allowing for age distribution, occupation distribution and character 
of mortality are or are not significantly different. 


Meanwhile (xxxix) provides us with a formula which enables us readily in the 
case of districts, special diseases or class groups to assert whether mortality 
experiences corrected for age distribution are or are not significantly different. 
As far as I can see (xxxix) and its extensions to occupation groups provide the 
proper means of ascertaining whether the populations at risk in various insurance 
offices or friendly societies are or are not materially different in character. It 
should be a guide to the actuary also as to which classes of the population it 
is most desirable to cater for. 


I have to thank my friends Mr H. EH. Soper and Mr A. W. Young for 
suggestions and help at several points. In the following paper numerical 
illustrations of (xl) are provided. 


* See Biometrika, Vol. x1. p. 164. 


ON CRITERIA FOR THE EXISTENCE OF 
DIFFERENTIAL DEATHRATES. 


By KARL PEARSON, F.R.S. ano J. F. TOCHER, D.Sc. 


(1) To determine whether the general deathrates or the deathrates for special 
diseases in two towns or in two classes of the general population are significantly 
different is a problem of very great importance. It is now generally recognised 
that crude deathrates are of little service for this purpose, the age distribution 
in the two districts or in the two classes of the population may be widely different, 
and the general deathrate or the special deathrate, i.e. the deathrate for a special 
disease, usually is a marked function of age*. The deathrate is therefore corrected 
by reduction to a “standard population.” We ask what would be the deathrate 
in the given district or class if its age distribution were constituted in a given 
or “standard” manner. This deathrate is spoken of as the “corrected deathrate.” 
Now we usually suppose the deathrate to be subject to “probable error.” In 
other words in a population of size n, if p be the chance of a person dying in the 
year, q the chance of a person surviving, then the standard deviation of the number 
of deaths is Vnpq, or the probable error of the corresponding deathrate m is 
67449, — 
CA 5. {Mie 


chances p’ and q’, we are tempted to compare without very full consideration 


If there be a second town of deathrate m’ and population n’, and 


m’ — m with -67449 nl fy + =e to obtain a measure of significant differentiation 


probably using a table of the probability integral. This method is for several 
reasons unreliable and fallacious. In the first place if it be applied to the crude 
deathrates, p and p’ can hardly be applied to the individual, they are so markedly 
a function of age. In the next place for many diseases, or for many ages in 
a general deathrate, p and p’ will be very small; accordingly there will be no 
approach to Gaussian distribution, but the binomials will approach Poisson’s 
Exponential Limit, in which case the meaning of the standard deviation of a 
difference requires much further consideration and probabilities will not be given 
by a table of the probability integral. 


* It is also well known to be a marked function of class. This is customarily disregarded in a 
comparison of local deathrates. But to test real sanitary efficiency a standardised class population 
may be as important as a standardised age population. 
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But the method outlined above has this importance: it suggests that the 
deathrate obtained is only a “sample” deathrate, and subject to the variations of 
sampling; thus it forces the problem upon us in a very definite form: Can two 
populations dying in a known manner during a given period be considered as 
samples drawn at random from the same material? Here again the age distribution 
difficulty arises, for by hypothesis we admit the age distributions of our two samples 
are not the same. Let us fix our attention for a time on a fairly narrow age group, 
say that d, deaths occur in an age group of size a, in a certain population and that 
the chance of death in the age group is p, and the chance of survival q, in the 
population out of which the sample is supposed to be drawn. Then undoubtedly 
the standard deviation of samples would be Va,p,q,, but as before the distribution 
would hardly be Gaussian. Now let us suppose the standard population, size A, 
to consist of the age groups A,, A,,... A,...., then the “corrected” deathrate 
M will be given by 


d, A, 
M=S (= =) Jian a (i), 
and if 6 denote a variation due to random sampling, we shall have 
&.d, A, 
3.M=8 (“= 4). 


Now speaking generally we do not “draw” our deaths in such a manner that 
their total remains constant. A shot so to speak fired at one age group is not 
to be supposed if it misses that group to have a chance of hitting a second age 
group. There will thus not be any of the usual negative correlation between a 
variation ind, and oneind,y. Of course epidemics which in a given period attacked 
individuals in certain age groups only might show a positive correlation between 
dd, and dd,. But in general it will be sufficient to suppose the variations of the 
d,’s independent, and measured by the probability of death in each group. Thus 
we should have 


Similarly if there be another population with age groups a’, and corrected deathrate 
M', then 


2 
and oy’? = S (Pte 4), 


We do not write p’, and q’, in this population, because we are supposing both 
to be samples from one and the same population. Now clearly 


M’-M=S (3 is “) 4 i ca 


Cae ayeA 
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and there will be no reason for supposing any correlation between d’, andd,. Thus 
it will follow that the standard deviation of M’ — M is Voz? + oy7?. Hence: 
ele Ac? 
Ow_y— Ss {p. Is (-. ar =) A EEA te AMR Te ns (iv). 
Now assuming p, and q, for the moment to be known, can we learn more from 
the relative values of M’ — M and oyw-_y than we thought possible from the 
distribution of the deaths in a single age group owing to the latter’s non-Gaussian 
form ? 
It seems probable that we can for the following reasons. Let z be the sum of 
u variates 2; + %,+... + 2,, these variates following arbitrary laws of frequency 
and being in no way correlated together, i.e. z is to be found by taking a random 
selection of each of our u-variates and adding them together, then from 
i OO) ae HOS asd ae Lp 

we can find the moments of z. Obviously we can measure all variates from their 
means, and accordingly we find: 

obg = S (afta) = UX ofl, 

elg = S (af) = UX ahs, 

eta = 8 (aft) + 6S (afte - oft’2), 

E “({u—1), - 
=UX aha a 6 oa a (fz), 


where ,f42, »#3 and ,u“, denote the mean values of the moment coefficients for the 
various z-distributions. Accordingly if B, and B, be the f-coefficients for 2: 


Ba ui? ae sfoversVoretetelaterorerolcletercjeleierotete(cleisic ole eisieievensic (v), 
wen i 
B, ee ay 
: ee: 
By = Se D) Palnancnnasvctictl Os case eee ostatiw tad (vi) 


where f, and £, denote f-coefficients found from the mean moments. Now in the 
case of the Poisson’s Exponential Limit to the Binomial we have* 


Pi=B2—3 ae 


np m 
where m is the mean number of deaths in the age groups. Hence, if we dealt with 
a fairly large population, where the number of deaths in any group were say 5 to 
10 and we made 10 age groups, we could reckon on B, and B, — 3 being of the 
order -02 to -01, or the distribution would be closely Gaussian. Thus there need 
be small hesitation in applying the tables of the probability integral to the 
investigation of the relationship of M’ — M to ojp_y. 


* Biometrika, Vol. v. p. 353 and Vol. x. p. 39. 
Biometrika x1 1] 
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(2) Two points, however, arise in this work. We do not know p, and q, 
nor have we yet selected the standard population, i.e. the values of ratios 
like A,/A. 

With regard to p, it may be held by some that the value obtaining in the 
general population should be given to it. This might be reasonable if that popula- 
tion were immensely large as compared with either group under consideration, 
but very often the group dealt with is quite considerable as compared to the 
remainder. Thus in Scotland for certain administrative purposes we consider 
Scotland (Clyde), and Scotland (excluding Clyde). In England for similar purposes 
we find nine official districts* selected, so that if we were comparing the North- 
western district with the London district, it would be curiously difficult to demonstrate 
why if these are samples, we take them plus the remainder to be more representative 
and fixed than either alone. We doubt very much whether the material we 
suppose we are sampling is to be considered as the general population. Rather 
we look upon that general population as the indefinitely large group who might 
be considered as living and dying under the same environmental conditions, if 
they continued indefinitely in force. 


Again, is it quite correct to take p, from the general population of the country 
when the problem is to discover whether the two districts are themselves random 
samples from some population which may not be the same as that of the general 
population of the country? Thus Aberdeen and Inverness might both well be 
random samples of a population which is not that of Scotland as a wholet. Hence 
as in cases of probable error it will usually be best to calculate p, from the observed 
material itself, i.e. if our two groups be really samples of the same population, 
then probably the best thing we can do is to take 
ite and q,=1-—— me 

a,ta@, asta, 


If p, be small, as it usually is, then it will be sufficient to take 


d,+d'.\ Ae 
ey-u=S \(‘ 7) art 


It remains to consider A,/A. Here again it is not unusual to take A,/A as 
given by the general population. Or if the “corrected deathrates” for a series 
of years are being compared, it is not unusual to reduce the age distributions to 
that of a certain year, e.g. in the manner of the English Registrar-General for 
tuberculosis and cancer deathrates to the population of 1901. For the same 
reasons as in the case just discussed we might, perhaps, find it fitting to take 
A,/A as given by the material under discussion, i.e. = (a,+’,)/(a+ a’). Or, 
again, we might reduce one district to the population of the other, i.e. take 
A,/A =a,/a, or a’,/a’. It is of interest to see practically what differences such 
divergent reductions make. 


d,+d, 


Ps 


* By the Board of Trade for example. 
+ Or again lawyers and the clergy may have or may not have significantly different mortalities, 
but the mortality of both differs essentially from that of all England. 
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Another solution of the-standard-population-problem, which seems to us of 
some importance, arises from the consideration that we ought to select our standard 
population so that the probability that the two districts or classes under considera- 
tion are samples of one and the same population should be a minimum. In other 
words we ought to select A,/A (= X,) so that 


Q=SAX)/VS &,X,), 
where A, = d',/a’, — d,/a,, 

Vs = PsYs(1/a’, + as), 
is a maximum, subject to the relation S (X,) = 1. 


Proceeding hy the usual rules for finding a max.-min. we have 


A, v,X5 
0=0* 8 (sax) ~ 5px) OX 
SOX) — 0. 
Therefore if P be an indeterminate multiplier: 
A, Vid, +P. 


S(A,X,) 8 (v,X,?) 
Multiply by X, and sum all such equations and we find P=0. Hence: 


A, S@,X,2) 
v5’ 5 (A,X) " 


~ Sum all such equations and we have 


Xs 


Ve) 
1=8 (7) Fax) 
Aes ah 
Thus we deduce: ia TA 
“8 (7) 
Vs 
whence SOROS & i iS ), 
r,2 d,\)2 
S (yeex) = S ()/Is (*)} ) 
and accordingly Q= yi S (*=) k 


A little consideration shows that this is a maximum value of Q. The argument 

we then use is that if on the standard population which provides a maximum for Q, 

there be no significance in the deathrate difference, there cannot be any significance 

at allin the difference. While on the other hand if on any population whatever 

used as standard, we do find a significant difference, such a difference really exists. 
11—2 
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Returning to our deathrate symbols we have 


di, d. 
A, en 1 ae 
73 ea > en (viii) 


@ ay (tee or very approximately to oe - a, 
s $s if oe a, 


We note at once that this method would give irrational values for A, in the 
case of any material for which the age deathrate was not invariably greater for one 
district or class. On the other hand Q the ratio of the difference of corrected 
deathrates to the standard deviation of that difference is always real and given by 


la’.a, e = d, 


2 
vole) approximatel 

deeds PP 5 

If for every value of s, d’,/a’, is either greater than d,/a,, or, on the other hand, 
is always less than d,/a,, then the odds are about 50 to 1 if Q be as great as 2 that 
there is significant divergence of the two corrected deathrates. On the other hand 
if Q has no significant magnitude, the deathrates will not be significantly different. 
Supposing the condition as to the relative magnitude of d’,/a’, and d,/a, be not 
satisfied, then if Q has no significant magnitude for these irrational age classes, 
it will certainly have no significant magnitude for any other size of age classes, 
and accordingly we conclude that the difference of the deathrates is not significant. 
But if Q be of significant magnitude for the irrational age classes, it does not follow 
that it will be significant for rational age classes, and further discussion is needful. 
Of course any case in which for the bulk of groups d’,/a’, is greater than d,/a,, 
but for one or two groups d,/a, is the greater, will have Q = (M’ — M)/oyw_m 
lessened by the inclusion of these cases, for the numerator of Q is decreased and 
the denominator increased. We should therefore be at liberty to consider only 
the groups where the deathrate goes one way, for the age groups are actually 
independent, and it is really a fictitious balancing of the corrected deathrates 
which arises, when one age class with deaths in excess compensates for another 
age class with deaths in defect, and so tends to equalise M and M’. This is of 
course a grave difficulty which must arise when we deal with any corrected 
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deathrate at all. Two such deathrates might show no significant difference 
although the aged were dying in one population and the young in the other in 
excess*. We might work only with the groups for which d’,/a’, —d,/a, is of 
the same sign, and this would indicate, by the same test, differentiation or its 
absence in the manner of dying; but of course we should then be dropping the 
idea of a “corrected deathrate’; that idea is, however, essentially imperfect and 
does not really distinguish effectually between differences in the manner of dying. 

(3) We now ask whether it is not feasible to interpret the value of @ in 
Eqn (ix) as a measure of the probability of a differential mortality without regard 
to the theory of “corrected deathrates.”’ 

As before let d, be the number of deaths in the age group of size a, in one 
district or class. We suppose this to be a sample of a population of which p, 
is the chance of dying in this age group, then if there be w age groups in each 
district or class, we have 2u deviations, all of which are independent, and given by 
d, — p,a, and d’,— p,a’,. These deviations have respectively standard deviations 
Va, Peq, and V/ a',p.qs- Accordingly if every deviation be measured in terms of 
its standard deviation we shall have a second moment coefficient given by 


re? = i s (G- Pets)') eS (see) 
Qu 4sPsVs a’ sss 


and X? ought to be unity. 


Now in the above expression p,, g, are the unknown values of the death and 
survival chances in the population from which the two districts or classes are 
supposed to be sampled, and the best values we probably can take for them are 
ps =1-—4q,=(d,+d’,)/(a,+’,). But inserting these values in the above 
expression, we find with the value of Q@ given in Eqn (ix): 


il i 
je 2 = 
w= 57 @, or BEG) a 


But the standard deviation of a second moment coefficient is V4, — po2|V n 
and this in the case of a normal distribution (uw = 3,2) equals Gf », U2 SO in our 


case, since = 1 and n = 2u, the standard deviation of L? equals 1/Vu. Thus 
we have to measure &? — | in terms of 1/Vu, or to ascertain the probability of 


a ratio of deviation to standard deviation of magnitude greater than (4Q? — u)/Vu. 
We see again therefore Q arising as a constant which naturally determines the 
mortality resemblance or difference of the two districts. But while in the 
previous approach to the solution of the problem from the corrected deathrates 
we found Q alone sufficed to obtain our criterion, we require in the case of 


* Another factor, sometimes overlooked when deathrate after correction is taken as a measure of 
local health, is emigration. If the sth age class tend to migrate from A to B, it is usually the healthy 
who migrate; thus the deathrate of the sth class in A will be inflated and that in B reduced. If the 
migration be chiefly that of males, as to mining districts and colonies, a spurious correlation between 
deathrate and sex ratio may be created. 


- 
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this second criterion to make use explicitly as well as implicitly of the number 


of age classes. We can thus give a physical meaning to Q, V2Q is the ratio of 
the standard deviation of the 2w age classes’ deathrates—each supposed to be 
drawn from the same population and measured in terms of its own standard 
deviation—to the standard deviation of this standard deviation, i.e. to 


ae . It by no means follows that this second criterion will give the same 


V2 x Qu 

result as may be drawn from the first. But this new aspect of @ frees us from 
many of the difficulties essentially associated with “corrected deathrates” and 
the indefinite category of a standard population. We must observe, however, 
that to evaluate the probability of the occurrence of Q we have again to justify 
the assumption that &? will follow a normal distribution. We could not justify 
this for the distribution of deaths in any single age group, nor even for the 
distribution of factors like (d, — p,as)?/(@spsqs) and (d’, — p,a’,)?/(a’,p.qs) summed 
for our two populations on one occasion, but we can do this for the distribution 
of &? on the basis of a number of random samples*. 


(4) We can again approach the problem by considering quantities like 


i} , 
and ¢ 2 Pas 


VasP4s Va' :Psds 


The mean of these deviations measured each in terms of its own standard 
deviation should be zero; and the standard deviation of this mean should be 


1 ; : ; : ‘ 
——, since there are 2u variates and, each being measured in terms of its own 
ZU 


standard deviation, the standard deviation of the series is as before unity. Thus if 
cola aL d,— pa d’, — psa 
w= Aol ca) nog te 
2u* | Vapds/ — \Va' Peds 


1 a 
m / (7; ) =V2u x m may be looked up in the tables of the probability integral, 


2u 


and the probability of the system, as a result of random sampling from a population 
Ps» Us, thus again determined. If we use the values of p, and q, so often adopted 
above, the quantity with which to enter the probability table, i.e. the ratio of 
deviation to standard deviation, is expressed by 


(! ) (Wa, — Vay) 


z= 1 GG 
VQ XT St eee x), 
“xm Vane We +d) Ca a [ (x) 
a, ) 


/ f} y < uy) 
where 1, = a, — d, and I’, = a’, — d’, are the survivors in the sth age groups. 


* We can show in a manner similar to that on p. 161 that the distribution of .’s approaches the 
normal when each of the constituent 2?’s is drawn from different populations, none of these populations 
being in themselves accurately normal. 
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(5) The previous methods are all more or less inadequate; they test whether 
a certain single character of the distribution does or does not present significant 
difference in the two populations compared. If we want to test the distributions 
as a whole, we must adopt a modification of the method given in Biometrika, 
Vol. vit. p. 250 for determining the probability that two systems of frequency are 
random samples of the same population. Jt has been shown in a memoir dealing 
with partial contingency* that the proper test is to determine 


A (@ = =) 
Getige (Oke)? Poe Ged, 
y 2 , y, 
(Gy SE) I tao Gaara (1 ees =) 


Xo = Si 


entering the Tables of Goodness of Fit with n’=s+1. Here p and p’ are the 
numbers in the sampled populations. Now the factor 
aa’, (p+ p')? 
(a,+a',)" pp’ 
is for most practical purposes unity, as we shall illustrate in the sequel. Hence for 


such purposes 
aa’ 22) 


hi 
Os M s 


: d, +d’, 
(hse) aan) 
of our earlier investigations. Thus to obtain the probability of the two districts 
being random samples of the same population all we have to do is to look up Q? 
for n’ = u-+ 1, wu being the number of age groups, in the Tables of Goodness of 
Fit. It will be seen that Q has a wide meaning, and generally speaking we may 
treat it as a constant, which can be used in a variety of criteria for testing the 
existence of differential deathrates. We propose to illustrate this in the following 
sections, applying in each case the four tests discussed above, namely : 

(a) The probable error of the “corrected” deathrates’ difference reduced to 
the standard population of maximum difference. 

(6) The significance of the means of the 2w deathrate classes. 

(c) The significance of the squared standard deviations of the 2u deathrate 
classes. 

(dq) The general y,? test of partial contingency with its approximate value Q?. 


The following illustrations have been used: 

(i) General Deathrates of Liverpool and Birmingham. 

(ii) Cancer Deathrates of Edinburgh and Dundee. 

(uii)—(vili) Cancer Deathrates for all England and Wales divided into four 
groups: (a) London, (b) County Boroughs other than London, (c) Urban Districts 
other than (a) and (6), (d) Rural Districts. 

(ix)—(xiv) Diabetes Deathrates precisely as in the preceding Cancer case. 


* Biometrika, Vol. x1. p. 157. 
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Illustrations. (i) We have for Birmingham and Liverpool the following data 
for all Males in 1911: 


Birmingham Liverpool 
Age Group sal ; : 
Population | Deaths Population Deaths 

0— 32,552 2003 45,889 3117 

b— 58,653 160 78,518 326 
15— 48,431 162 62,751 309 
25— 47,212 252 58,216 476 
35— 37,897 382 47,711 632 
15— 25,431 454 32,664 775 
65— 15,384 575 20,198 944 
65— 7,535 511 10,215 904 
75— 1,944 321 2,194 335 
85— 173 58 191 62 
Totals 275,212 4879 358,547 7880 


Our problem is to discover whether there is actual differentiation between these 
two systems of deaths, and if so, what is the measure of it. We work with ten 
age groups. 


The actual arithmetical work is indicated on the following page. It leads 


to Q? = 165-5031 or Y = 12-8648. Hence applying first test Ms 12-8648. 
Om'-M 
Or: the difference of the two deathrates corrected to the population of maximum 
ratio is no less than 12-86 times the standard deviation of the difference. We 
conclude that the chance of such a difference arising from random sampling is 
enormous*, or the two deathrates are most certainly and markedly different. 


There is no difficulty in correcting the deathrates to the population of maximum 
difference as standard, but it is of interest to note what happens, if the standard 
population has other values. 


For example, when we correct to the male population of all England and Wales 


for 1901, we find using the formula of p. 161 
aes 


If we use the general male population of England and Wales in 1911 we find 


Ae 10-1567. 


Om’-M 


* The chance is approximately 3-3508/10°*. This test is practically valid in this case for the general 
deathrate of Liverpool males is greater at all ages except 75 onwards than that of Birmingham and the 
age groups above 75 contribute nothing of importance to the value of Q2. 
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These values of course lead to the same conclusion but show that the ratio 
(M’ — M)/oy-m varies from standard population to standard population, and 
may be increased more than 20 % when we pass to the population of maximum 
difference as standard. Such an increase might be of considerable importance in 


our estimate of differentiation if the ratio (M’ — M)/oy-_y lay between 1-8 and 
2-2, say. 


Our third test is given by the formula on p. 165: 
(Q? — u)/Vu = 72-7515/V/10 = 23-01. 
The probability of a deviation as great or greater than this arising is immense. 


Thus we see that the distribution of the squares of the actual deviations is 
excessively improbable. 


Proceeding to our second test we find 
m = — 16625, 
: it : avec 
and accordingly 7 / s) = -745, or m does not differ significantly from zero. 
2u 
Thus by this test no essential difference would be indicated. This does not show 
that it does not exist, but only indicates the inadequacy of the test. In fact since 
d,— p,@,+d',— p,a’, = 0, 


and there is no great difference between Va,p.q, and Va',p.q;,m tends to be zero, 
even with considerable differences between d, and p,a, or d’, and p,a’,. 


We have seen that the value of Q? in the Liverpool and Birmingham case is 
165:5031. We will now investigate the value of the factors 


ppiiiptp 


for the ten age groups. They run 


Age Factor Age Factor 

| 0—5 98817 45—55 1-00182 
b—15 99626 5d—65 -99895 
15—25 | 1:00071 65—75 99440 
25—35 1-00652 75—85 1-01376 

| 35—45 1-00422 85 and over 1:01510 


It will be seen that the factors differ very little from unity and introducing them 
into the several terms of Q? we find 


X02 = 165-4695, 


or x,2 only differs by 0-02 °% from Q?. Applying the test for goodness of fit to 
xo? for n’ equal eleven groups* we find 


P = 2-40/1020, 


* See Tables for Statisticians, p. xxxiii. 
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or the odds against Liverpool and Birmingham general mortality experiences being 
samples of the same population are gigantic. This test which takes the distributions 
as wholes seems to us absolutely conclusive. The differences in deathrates in 
Liverpool and Birmingham are not due to difference of age constitutions, but are 
fundamental *. 


(ii) Cancer Deathrates in Edinburgh and Dundee, 1891-1900 (Males) +. 


Edinburgh Dundee 
Age Group -—— = a ee eee 
Population Deaths, Cancer Population Deaths, Cancer 
| 0—5 149,763 5 89,775 6 
d—15 280,655 8 168,510 2 
15—25 274,343 18 142,917 7 
25—385 214,063 47 98,953 10 
35—45 158,133 117 76,132 47 
| 45—55 115,206 295 59,066 114 
55—65 69,954 356 37,337 151 
65—75 32,966 266 16,958 109 
75—85 10,311 93 4,625 30 
&5 and over 1,045 8 535 3 
Totals 1,306,439 1213 694,808 479 


Proceeding as before we find Q? = 28-4837. The factors for the age classes 
to give x,” from the Q? terms for the separate age groups are given below. They 
differ more from unity, but some being in excess and some in defect, there is no 
substantial difference between y,? and @?. 

We have Xo" = 28-0393, 


OF, Xo? 1s 1-6 % less than @?. 


Age Group Factor: O60 s / (% or) Age Group Factor : it al / ( a) | 
Pp prp Ppp /\p+p 
0—5 1-03386 45—55 -98855 
5—15 1-03427 55—65 1-:00711 
15—25 -99360 65—75 -98966 
25—35 *95387 75—85 -94318 
35—45 ‘96788 85 and over 98812 


We now apply the same tests as before: we have 
(M’ — M)/ow-m = Y = 5:33870. 
Hence 4 (1 + a) = -99999,99527, 


* This only means that the differentiation cannot be accounted for by age differences, it might well 
be accounted for by class or occupation differences. 
{ The results are sums of ten years’ population found on assumption of arithmetical progression. 
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or, the chance of a deviation so great as this appearing is only 4-73/108, or the 
“corrected” cancer deathrates for males are significantly different in Dundee and 
Edinburgh. 


Using the General Population of Scotland, 1891-1901, as a standard population, 


we find 
(M’ — M)/ow-m = 4:7399. 


Using the General Population of England and Wales (1901, males) as a standard, 


we find 
(M’ — M)/oy:-y = 4:7630. 


These again both mark significant deviations in the corrected deathrates, but 
fail to give the maximum of significance. 


Now let us apply the test of distribution of squares of differences. We have 
($Q? — u) Vu = 4:24186/V10 
= 13414. 


Such a deviation would occur about once in ten trials and is not necessarily 
significant. 


Again applying the test of mean value, we have 


m= — +1564, 


and accordingly ml(s =) = -6994, 


and this is an insignificant ratio of the deviation to its standard deviation. 


We now turn to the x,” test, where we have yo? = 28-0393. The Tables of 
Goodness of Fit provide for »’ = eleven: 


P0018, 


or the odds are nearly 500 to 1 against such a deviation on random sampling. 
We conclude that there is a significant difference between cancer mortality in 
Dundee and Edinburgh. It is noteworthy that the corrected deathrates criterion 
which when analysed seems so very unsatisfactory gives here as in the case of the 
Liverpool and Birmingham General Deathrates far greater significance to the 
observed differences of mortality. 


We have not considered it worth while to investigate for this case the test 
of the significance of the mean of the 2w deathrates. It is we believe inadequate 
and further is laborious to calculate. It is, we hold, sufficient and more enlightening 
to calculate y,?, and, what is almost deduced in the same process, the quantity Q?. 


There is a further point which may be illustrated on the Liverpool-Birmingham 
and Dundee-Edinburgh data. We have supposed in the course of our work that 
(d, + d’,)/(a, + a’,) is an extremely reasonable value to give to p,. If we assume 
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that the distributions of quantities like d, are given with sufficient accuracy by 
the normal curve then the probability of the whole result observed is given by 


CE ae ie 1 @ani 
II = Product | LEE ] x — 2 PIs ] Vira )) 
V0; Ds Ys 1 @ sPs1s 1 
where V is the continued product of the differentials of the d,s and the d’,’s, 
and we require to make this a maximum for the variation of all quantities like p,. 
In other words we have u equations found by differentiating the above expression 
for p,, Po, --- Ps» ++» Pn (of course q, = 1 — p,) to determine the best values of these 
quantities. Taking the logarithmic differential of the above product with regard 


to p, and equating it to zero, we find since 69, = — Op,: 
1 1 | d, — psQs a Os Ds s 1 (d, = pss)” (d', = pa) 
a Ps zi Ys | Bad. ; sl As Ps? Is" Gap. (Ds —_ qs) 
Hence: 


d,+ d' ‘ it Ys — Ps {(: ak PsMs)” | (d', = Ds 5)” 


Be a,ta, 2,40) GPs a 9a, Pens} en 


, 


eter d,+d 
Now assuming to a first approximation p, = aaear: we find to a second 


approximation #, 


ee ay 
d,+@', 1-2pf, 1 * Fa, a) 


Ps a , t aay 
Ga, G+, 2; Gaeds (1 : a) 
\ a,+a’, 
GER ENG 
OE (“ ar 
Calling as before C2 - a ae: 
(d, a d ) (1 aa: (pee al 
i EG 
= O25 S 1Q)2 
* ihe (2 aa (1 = oor | (1 2Qs ) 
we have —————————————s 
a,t+a’, a,+ 0’, 
Hence we may write Ds Gesteenes On 
a,+ Qs 


this amounts to altering the deaths by 


ae (1 g tet ny G=10.), 


a,+a’, 


which is usually a very small number, or we may write 


. a+’, ae 1 2 
s a, a we i (1 = +Q; ) le ats d', a, zie male 
where the factor 
ll 2 
ae 192 Cae 
jE = 0) ee eel 


will only differ slightly from unity. 
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The following table gives the values of p,, p, for the various age groups in 
the case of (a) Liverpool and Birmingham for General Deathrates, (b) Edinburgh 
and Dundee, Cancer Deathrates, (c) Rural and Urban Districts, England and 
Wales, Cancer Deathrates. 


Illustrations of Approximate Deathrates for unknown sampled Populations. 


(a) Liverpool and (6) Edinburgh and (c) Rural and Urban 
Birmingham, General Dundee, Cancer Districts, England and 
Deathrates Deathrates Wales, Cancer Deathrates 
Age Group | 
| Ps Ds Ps Ds Ps | Ds 
| | 
| 0—5 -065,272 -065,332 -000,046 000,045 |) . aye 
Lo ons 003,550 | -003,611 | -000,022 | -000,021 |f 000025 | -000,025 
15—25 -004,236 -004,299 -000,060 -000,058 -000,033 -000,033 
25—35 ‘006,905 -007,039 -000, 182 -000, 187 -000,116 ‘000,116 
35—45 ‘011,845 -011,937 -000,700 -000,698 -000,404 -000,404 
45—55 -021,155 021,335 -002,347 -002,360 -001,500 -001,500 
55—65 -042,690 -042,905 -004,725 -004,726 -004,256 -004,256 
65—75 -079,718 -080,269 -007,511 -007,532 -007,787 -007,788 
7d—85 -158,531 -158,498 -008,235 -008,252 -009,653 -009,646 
85 and over -329,670 +328,756 -006,962 -006,405 -008,856 -008,847 


It will be seen that the corrective factor contains the inverses of the total 
number (d,+d’,) of deaths and the total numbers of individuals (a,+ q@’,) in 
the combined age groups s and s’. Hence for big districts as in (c) the 
corrective factor is of small importance even for special diseases. For the 
general deathrates in two large towns as in (a) the difference between p, and 
p; 1s as a rule less than 1%. Even in special diseases in towns of moderate 
size, it is only where the total number of deaths in the combined age groups 
s and s’ is very small that any substantial divergence between p, and @, 
arises, e.g. in (b) for the child or extreme old age groups. Thus the value of @? 
is hardly likely to be modified practically, if we replace ~, by p,. Accordingly 
ot ae besides being easy to calculate, is a reasonable approximation to the 
better value p,. Of course #, itself is only an approximation* to the “best value” 
and this “best value” also depends on the accuracy of replacing the binomial 
by a normal curve. Thus it is by no means certain that, if we obtained a true 
best value for the deathrate in the unknown sampled population, it would be 
markedly nearer to #, than to p,. We content ourselves by remarking that 
neither for practical nor theoretical reasons does there seem likelihood of a great 
gain resulting from taking any other value than (d, + d’,)/(a, + @’,) for p,. 


* We should have to solve a cubie for each age group to find the accurate “best value,” on the 
above hypothesis of normality. Doubts may also be raised as to the legitimacy of the theory which 
makes II in (xii) a maximum. They are discussed in another paper. 
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(i1)—(vill) Cancer Deathrates for all England and Wales. 

The object of the present illustrations is to ascertain whether there are 
significant differences in the cancer deathrates associated with urban and rural 
conditions. We divide the data into four groups: (a) London, (b) County 
Boroughs other than London, (c) Urban Districts other than County Boroughs, 
(d) Rural Districts. We compare pair and pair these four groups in order to 
ascertain the degree of their significant differences. 


The following data are taken from the Registrar-General’s 76th Annual Report 


and are for the year 1913*: 


Populations and Cancer Deaths in Age Groups, 1913. 


How far are these differences in the crude deathrates really significant, when 
what is the numerical measure 


allowance is made for age groups? 


of this significance in the six cases? 


Above all: 
The following table gives the deathrates 


per 100,000 at each age in the four groups: 


Populations 
b) County c) Urban d) Rural 
Age Group (a) London ibe  istriets istriote 
0—15 648,361 1,796,847 1,976,078 1,242,634 
15—25 383,601 1,006,039 1,126,179 716,323 
25—35 362,967 938,541 1,017,991 580,305 
3B5—45 293,427 766,070 836,306 496,185 
45—55 213,635 536,949 586,962 395,443 
55—65 131,423 331,209 369,840 276,814 
65—75 68,124 166,717 200,982 180,029 
75—85 18,830 43,125 60,407 65,453 
85 and over 2,239 4,493 7,361 9,125 
Cancer Deaths 
0—15 25 31 | 47 34 
15—25 24 39 34 Dil 
25—35 51 128 121 65 
35—45 130 416 318: 185 
45—55 472 1075 971 503 
0d—65 723 1700 1675 1077 
65—75 655 1312 1607 1360 
75—85 209 450 576 639 
85 and over 29 29 ie 783 
Populations 2,122,607 5,589,990 6,182,106 3,962,311 
Total deaths 2,318 5,180 5,457 3,963 
Crude death- 
rates per | 
100,000 109-21 92-67 88°37 100-02 


* See pp. 4, 217, 234, 253, and 271. 
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Male Cancer Statistics, England and Wales, 1913. 


b) Count c) Urban | d) Rural 
Age Group (2) London pore: Cees ae 
SS Re i= 
0—15 | 3°86 1:73 2-38 | 2:74 
15—25 6:26 3°88 3-02 3:77 
25—35 14-05 13-64 | 11-89 11-20 
35—45 | 44-30 54:30 42-21 37:28 
4555 | 220-94 200-21 165-43 127-20 
55—65 550-13 513-27 452-90 389-07 
65—75 961-48 786-96 | 799-57 755-43 
75—S85 | 1109-93 1043-48 953-53 976-27 
85 andover | 1295-22 645-45 | 991-71 800-00 
| : s 
Corrected | 
deathrates * 111-48 101-01 91-97 82-24 
Crude 
deathrates 109-21 92-67 88:27 100-02 
| 


In the first place we take the “corrected deathrates” reduced to the male 
population of 1913. We have: 


Pair M-M \ oye | (M-Miloy-y Q 
London and County Boroughs doo 10-47 2-666 3:93 6-96 
London and Urban Districts ... tio 19-51 2-522 | 7-74 8:95 
London and Rural Districts ... a 29-24 2-536 11-53 13-09 
County Boroughs and Urban Districts | 9-04 1-848 4-89 7-39 
County Boroughs and Rural Districts | 18-77 1-953 | 9-61 12-25 
Urban Districts and Rural Districts... 9-73 1-829 | 5:32 6-76 


To begin with it will be seen that there is a very considerable difference in the 
values of (M’ — M)/oy—~ and Q, or the reduction to the general population of 
males gives nothing like the same intensity of significance to the differences 
between the means as the reduction to the standard populations of maximum 
difference. We are unable to determine what is the standard population of real 
maximum difference in any casey}, and this very fact seems to discredit the use 
of the deathrates corrected to an arbitrary standard population as a means of 
adequately testing differences in mortality. For, although in this case all the 
differences of the deathrates corrected to the general male population are 
significant, in the next case—for example diabetes—they may not be, while the 
differences of the deathrates corrected to the standard population of maximum 
difference may be—as in the case of diabetes practically, they are—of significance. 

* The “corrected deathrates”’ are here reduced to the assumed male population of 1913 of all England 
and Wales. They differ therefore somewhat from the Registrar-General’s “corrected deathrates”—110-0, 


98-9, 90-2, 80-6 respectively. which are deduced from the general population of England and Wales, 1901, 
+ Q may be deduced from an unreal population of maximum difference, i.e. one with some age classes 


negative. 
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If we consider the order of the significance deduced from the two standard 
populations we have: 


Q | Pair (M’- Money | 
eee = me 2||ceae a | 

13-09 / London and Rural Districts ... re 11-53 | 
12-25 | County Boroughs and Rural Districts 9-61 | 
8-95 | London and Urban Districts ... ues 7:74 
7:39 | County Boroughs and Urban Districts 4-89 | 
6-96 London and County Boroughs oe 3:93 | 
6-76 Urban Districts and Rural Districts 5-32 | 


The Q order shows that London and the County Boroughs are markedly 
differentiated from the Rural and Urban Districts—in the higher degree from the 
former. Thus the city districts tend most to a high cancer rate. London is 
differentiated from the County Boroughs, and the Urban from the Rural Districts, 
but to a less extent than in the previous cases. The (MZ’ — M)/oy-_ 3, test confirms 
this order except in the case of Urban and Rural Districts which are now more 
highly differentiated than County Boroughs and Urban Districts. Thus we note 
that a particular standard population may not only influence the significances of 
the differentiated deathrates, but also the order of these significances. This result 
will be confirmed in the case of diabetes. 


The following are the values of Q? and xp": 


Pair | Q? | Xo" | 

RE ee ae na 2 See ee 

| | 

London and County Boroughs | 48-4567 | 49-2414 | 
London and Urban Districts ... .. | 80-0869 | 80-6986 
London and Rural Districts ... es 171-3498 164-2686 
County Boroughs and Urban Districts 54-5901 54-3279 
County Boroughs and Rural Districts | 149-9172 | 151-0230 


Urban Districts and Rural Districts... | 45-6922 46-5109 


It will be seen that the only considerable difference between Q? and y,? arises 
in the case of the London and Rural Districts pair, and more than half this 


difference is due to the very heavy deathrate in London from cancer of persons 
between 65 and 75. 


Before we discuss the inferences to be drawn from these results, we will place 


l 
on record the ratio of m to its standard deviation Vou The easiest way to 


calculate this ratio is from the components 


Biometrika x1 12 
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We have m= yt TES, a, 
au” Vv as aa. 
where Y, must be given the sign of ab — aH The advantage of this method is 
s s 


that the value of Q, has usually been tabled as a stage to the finding of 
Q? = 8,“ (Q,?). In the present case we deduce: 


Arithmetic Value of mm Ta 


2Qu 
London and County Boroughs __... a abc ad be pe 1-00 
London and Urban Districts ike see sth sae Bek ake 1:94 
London and Rural Districts ae nee nie be Beis Bee 1:94. 
County Boroughs and Urban Districts... Bue nee 600 cee 0-06 
County Boroughs and Rural Districts... “on Age 606 Bod 0-49 
Urban Districts and Rural Districts Si ma obs 2h ane 0-26 


We can now calculate the probability that each pair are random samples of 
the same population by aid of our four tests. We place the pairs of districts in 
order of their improbability as random samples of the same population, taking 
Xo? as our standard test. 


Probability P of the Districts being Samples of the same Population. 


ea f Test from Test from 
Paired Districts compared G Xor Test, or Corrected |  aeaaeaee e Deviations 
for Cancer Mortality oodness of Fit Deathrates | Faye 
Test (M’—Myfonr, | Deviations a n/ = ) 
MM) (3Q?-u)//u /2u 
London and Rural | | 
Districts... 9625/1089 1-8031/10°9 |: 1-9628/10144 3121 
County Boroughs andl | 
Rural Districts... -5414/10?" -7929/1084 =| 1-9385/10197 0262 
London and Urban 
Districts —... 1-1740/10!8 1-7497/10% — | -2555/10?8 0262 
County Boroughs and | 
Urban Districts... 1-6355/108 -6606/108 | -5133/10° 4761 
London and County 
Boroughs _... ... |  1:4996/107 1-6586/1012 -1927/108 -1587 
Urban Districts and | 
Rural Districts | 4214/10? 6327/1011 | +1962/105 +3974 
| 


The probability* in the first of these tests is deduced from the Goodness of 
Fit Tables; in the remaining three tests from the Tables of the Probability 
Integral. It will be seen that the first three tests give absolutely the same order 
of significance for the six pairs of differences. The fourth test is irregular and 


* The very high improbabilities given are only rough approximations, sufficient, however, for our 
present purposes. Since we enter the Goodness of Fit Tables with n=ten (i.e. w+1), we must use the 
first value of P in Equation (xxix), Vables for Statisticians, p. xxxi. The integral J may then be 
calculated by the first term of the Schlémilch formula (ibid. p. xxxiii), as this integral will only affect 
the fourth figure in the decimals. Table IV (bid. p. 11) has been used to approximate to the extreme 
tails of the probability integral. Very useful work could be done by extending this Table between 5 
and 50 to the first decimal place in the argument. 


jee. 
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confirms the view already expressed that but little is to be gained from its use. 
The fact is that each test measures a different feature of the difference of the 
distributions and the worst test will be that which measures the least important 
characteristic. There is little doubt that the deviation from zero of the mean of 
all the deviations measured in terms of their §.D.’s is this characteristic. The 
second test measures the significance of the “corrected” deathrates for the standard 
population of maximum significance gives results which are most closely in accord 
with the x,2, but it suffers from two rather serious defects: (i) it is conceivable 
that M might be very close to M’ and yet the actual distribution of deaths very 
different, (ii) the standard population which gives the maximum significance to 
the difference of the corrected deathrates may, as we have indicated (p. 164), be 
an impossible one. Hence the values of the significance may be very considerably 
exaggerated. It has been our experience, that when we have taken other standard 
populations, we have found the @? considerably less than for the population of 
maximum significance, but not always most markedly less. 


On the whole we think a test which considers the general distribution of 
deviations more likely to show definite results, than one which considers only 
a mean, and from this standpoint we hold that the x9? and (4Q? — u)/Vu are the 
better criteria. The main assumptions on which these tests are based are for the 
latter: that the distribution of squared deviations (each measured in terms of 
its own 8.D.) will, even if each deviation be selected from a non-normal frequency, 
give a second moment distribution which follows the normal law; and for the 
former: that the Gaussian curve accurately enough describes the frequency given 
by a binomial. This assumption would be more closely fulfilled by a general 
than by a special disease deathrate, but is probably more valid than the previous 
assumption. Hence we believe that while the three first tests have all a certain 
value, the first and third are to be preferred and the first is best of all. 


Judged by the first, second and third tests we conclude that significant 
differences can be definitely said to exist between all these cancer mortalities, 
Urban and Rural Districts showing the least but still a very weighty significance ; 
that London and the County Boroughs other than London have significantly 
different cancer mortality, while both London and the County Boroughs differ 
conspicuously from the Urban and Rural Districts. 


It is clear that the degree of significance is closely associated with some variate 
which increases with difference of position in the scale (a) London, (b) County 
Boroughs, (c) Urban Districts, (d) Rural Districts, i.e. with some factor which 
increases with the city character. The increase during the past fifty years in the 
cancer deathrate has been associated by some with improved diagnosis. Is the 
variate correlated with the above order that of better diagnosis? It may, perhaps, 
be doubted whether the general practitioner is much more competent in cancer 
diagnosis in London now-a-days than in the Rural Districts. But an examination 
of the terms of (? or x,” shows that nearly half of the significant difference arises 
from the terms Q,? and y,? corresponding to the 45 to 55 group and nearly a third 
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corresponds to the 55 to 65 group. . In fact in the case of London and Rural 
Districts about three-fourths and in the case of County Boroughs and Rural 
Districts about four-fifths of the value of Q? are contributed by the age groups 
45 to 65. This is not the period in which the total number of deaths from cancer 
is a maximum; it may therefore be the period in which the less skilled medical 
man is less likely to diagnose cancer. It would be equally valid, however, to 
assert that cancer finds more susceptibility in town than in country dwellers and 
that this is particularly the case with men from 45 to 65 years of age. Further 
suggestions are, of course, emigration of cancerous persons to the towns* and 
the presence of certain occupations with high cancer deathrates in the towns. 
In both these cases we must find explanation for the particularly marked contribu- 
tions for the age groups 45-65, unless we are content with the view that these are 
the age groups where the cancer deathrates are fairly high, and the population of 
the group with which the result is weighted fairly large. 

It would be in every way desirable if we could, applying ‘“‘Occam’s razor,” 
attribute to one source the rising cancer deathrate and the significant differences 
between cancer mortality in cities and in rural districts. Both may be due to 
differential diagnostic power or to varying accuracy of certification, but we gain 
little by merely throwing out suggestions, and omitting to demonstrate them. 


(ix)—(xiv) Diabetes Deathrates for all England and Wales. 


As a last illustration of the present method we take the deaths from Diabetes 
for the year 1913 from the Registrar-General’s Report} in the same fundamental 
groupings; of course the populations at risk in age groups will remain the same. 


Diabetes Deaths in Age Groups, 1913. 


‘a | | (b) Count: c) Urban (d) Rural | 
Age Group \. he Londons Boreuctiet Opisteiets Districts 
0—15 | 8 22 22 26 
15—25 | 12 35 46 38 
25—35 | 20 60 69 38 
35—45 | 21 63 66 44 
45—55 49 86 102 51 
09 —65 | 59 158 196 106 
65—75 47 178 181 137 
7d—85 | 16 47 57 50 
$5 and over | 0 5 6 i 
Tota] Deaths | 232 654 745 491 
sk |aauee Rees We ee 
Crude deathrates | 10-930 11-700 12-051 12-392 
Corrected deathrates | 11-099 12-649 12-433 10-948 
(Both per 100,000) | 


* It must be remembered of course that institutional deaths are since 1911 distributed to their locus 
of origin, and therefore immigration for operation no longer tends to swell the London or County 
Boroughs cancer deathrates. It would be otherwise with immigration for permanent residence in the 
initial stages of the disease. But there is no evidence at present to show that cancerous persons of 
ages 45 to 65 do move into the larger towns. + See pp. 217, 235, 253 and 271. 
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The “Corrected” Deathrates in the table on p. 180 are based on the Age Groups 
of the total population for England and Wales in 1913. It will be seen that they 
modify entirely the order of the Crude Deathrates, which order appeared at first 
sight to be comparable with the cancer results, as showing a difference between the 
city and rural or urban areas—in this case indeed to the advantage of the former. 
Proceeding as in the case of cancer to compare significance of corrected deathrates 
when reduced to the general population of males (1913) as standard and when 
reduced to the standard population of maximum difference we find: 


| 


l 
Pair MW -M | oyp_y | (M’-Mycy_y Q 
ss : Pee ae eae 
London and County Boroughs “ts — 1-550 8898 1-74 4-16 
London and Urban Districts ... .. | — 1334 “8794 | 1-52 3°16 
London and Rural Districts... ais + 0-151 -8829 | 0-17 3-95 
County Boroughs and Urban Districts | + 0-216 6727 | 0-32 2-34 
County Boroughs and Rural Districts + 1-701 -6998 2-43 5-76 
Urban Districts and Rural Districts ... + 1-485 -6790 | 2-19 5:03 
| 


Now with the single doubtful result for County Boroughs and Rural Districts 
none of the values of the ratio (M’ — M)/oy_y can be definitely considered as 
rendering M’ — M significant. On the other hand all the differences of the corrected 
deathrates reduced to the standard population of maximum difference must, with 
‘one possible exception, i.e. County Boroughs and Urban Districts, be considered 
as markedly significant. This illustration is of great interest. For it is quite 
easy to select a standard population where for real age classes there is a significant 
difference between the “corrected” deathrates of London and the County Boroughs, 
but reduced to the general population (males) for 1913 there is no such difference. 
We see therefore that the reduction to an arbitrary standard population may be 
absolutely misleading as a means of testing whether two class deathrates are 
differentiated. Again the order of pairs for significance in the case of diabetes 
is for Q: 


Q Pair (M’ = M)/oyg-—ag 
5-49 County Boroughs and Rural Districts ... 2-43 
5:03 Urban Districts and Rural Districts 2-19 
4:16 London and County Boroughs ... 1-74 
3°95 London and Rural Districts 0-17 
3-16 London and Urban Districts... ses 1-52 
2:34 County Boroughs and Urban Districts 0-32 


It will be seen that the significance of London and Rural Districts is considerably 
displaced by the general population as standard. Or, we conclude, as in the case 
of cancer, that relative significance as well as absolute is determined by the special 
standard population selected and we cannot accept the current view that the 
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“standard population” to which the deathrates are corrected is a matter merely 
of convenience. 


It is clear that there is nothing in the “corrected”’ deathrates to indicate in any 
group a substantial difference from the rate 11-883 per 100,000 for all England 
and Wales*. It will accordingly be of some interest in this case to test whether 
this result of non-significance which flows from the deathrates corrected to the 
general population is confirmed when more accurate methods are applied to test 


the distribution of deviations as well as mean rates. The following are the values 
of Q? and x,7: 


Pair Q? X02 
| London and County Boroughs ... 17-3188 18-0702 
London and Urban Districts 10-0126 9-9831 
London and Rural Districts an 15-5939 15-4501 
County Boroughs and Urban Districts 5-4693 5:4236 
County Boroughs and Rural Districts 30-1470 30-4870 
| Urban Districts and Rural Districts 25-2912 25-8263 


It will be seen as in previous cases that there is no substantial difference between 
Q* and: 1757. 


The following table gives the deathrates from Diabetes per 100,000 exposed to 
risk in each age group of the four categories: 


Male Diabetes Statistics, England and Wales, 1913. 


Age | b) Count (c) Urban d) Rural 
Group ; (2) London Rome Districts Costa 

| — = - 

| O15 1-23 1-22 Ll 2-09 
15—25 3-13 3:48 4-08 5:30 
25—85 5-51 6-39 6:78 6-55 
35—46 7:16 8:22 7:89 8:87 
45—55 22-94 16-02 17:38 12-90 
55—65 44-89 47-70 53-00 38-29 
65—75 68-99 106-77 90-06 76:10 
7d5—85 | 84:97 108-99 94-36 76:39 

| 85 and over 0 112-28 81-51 10-96 


The problem before us is again whether the above distributions of deathrates 
as a whole are or are not significantly different, and not whether the differences 
in one of their statistical constants (i.e. the “corrected”? deathrate) are or are 
not significant. We shall apply only three tests, i.e. (i) the xy? test or Goodness 
of Fit Test, (ii) that of the difference of corrected deathrates for the standard 


* The deviations from the General Rate are (a) —-784, (b) +-766, (c) +:550 and (d) —-935, all 
of the order of the probable errors of the differences. 
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population of maximum difference, and (iii) the Test from Distribution of Squares 
of Deviations. 


Proceeding as in the case of Cancer we have the following table: 


Probability P of the two Growps being samples of the same Population. 


Gee aoe | (iii) Test from 
Paired Districts compared AW je ee Corrected gs aaa of 
for Diabetes Mortality oe. eat oii | Deathrates aaa 
est M_-M : Deviations _ 
( Vow—m| io, 
gu | a@-wivu 
| 
So ——. SS 
Rural Districts and 
County Boroughs 0004 20/10? 0215 
Rural Districts and 
Urban Districts ... 0022 *25/108 | 1121 
London and County 
Boroughs ... aa 0344 “000,02 4548 
Rural Districts and 
London _... ras -0801 000,04 3442 
London and Urban 
Districts ... ae 3520 -000,7 -0916 
Urban Districts and 
County Boroughs 7943 -009,7 0184 


As before the order of significance, as we might expect, of the first two tests 
is the same*, but it is no longer as in the case of cancer the same as in the third 
test. The third test shows only significance between the members of the two pairs 
Rural Districts and County Boroughs and Urban Districts and County Boroughs, 
and neither difference is at all emphatic, while the last is out of accord with the y,? 
test. The y,? test shows distinct significance between Rural Districts on the one 
side and County Boroughs and Urban Districts on the other. The Rural Districts 
are not very significantly differentiated from London. Probably London and the 
County Boroughs have a different diabetes mortality. An examination of the table 
on p. 182 shows that the chief difference between the Rural Districts and the Urban 
and County Boroughs mortality is the much lessened deathrate after 50 years of 
age; while the difference between the Rural Districts and London lies in the 
much greater deathrate from diabetes under 45 in the Rural Districts. Such 
differences might be so balanced that there existed no significant difference in the 
corrected deathrates. No one could judge by the corrected deathrates of 10-948 
and 11-099 with a probable error of -60 that the Diabetes mortalities of Rural 
Districts and of London are as significantly differentiated as they are. 


We see that the second test enormously exaggerates the significances determined 


by the first test and this is precisely what we might anticipate. The second test 


* The order of significance in the third test will not now be the same as in the second, although 
it depends only on Q? because ($Q? — wu) can be positive or negative. 
because Q? was always greater than 2u. 


It was the same in the case of cancer 
Positive or negative values of (4Q?—w) only signify that the 
mean of the squares or deviations exceed or fall short of the theoretical value respectively. 
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is the difference between the corrected deathrates, corrected to a standard popula- 
tion which gives the maximum difference between those rates. But this standard 
population is, if the individual age group deathrates are not all greater in one 
district than the other, an algebraical fiction and not a real standard population. 
But with the single possible exception of London and County Boroughs there is 
no approach in the case of diabetes mortality to greater deathrates in all the 
age groups of one district class. This was far more nearly the case in the cancer 
mortality. Hence the second test was more reasonable in that case. 


We have retained this maximum difference of corrected deathrates to the end 
of our illustrations, because we think it serves to indicate the danger of any 
argument as to significant differences in mortality based on “corrected deathrates.”’ 
In the case of diabetes, our district classes give no such significant differences. But 
our x,” test shows that these differences actually exist, as indeed might be suspected, 
although their numerical valency could not be adequately tested by a mere 
examination of the age group deathrate table on p. 182. The reason for this 
failure of the corrected deathrate difference lies largely in the fact already insisted 
on that the significance of the difference in the corrected deathrates depends largely 
upon the standard population selected—a point often overlooked. What then is 
to be the standard population selected? Clearly it should be such as (i) to make 
the corrected deathrate difference a maximum and (ii) at the same time be a real 
population, i.e. not one with negative age classes. At present we do not see how 
to reach the maximum of a certain function of variates, subject to the condition 
that the variates are to take positive values only. The corrected deathrates for 
diabetes in certain districts show no significant differentiation when reduced to the 
general population of England and Wales as standard. They show a marked 
differentiation when reduced to the standard populations of maximum difference. 
It is true that these populations are merely algebraic fictions, but how far should 
we approach this marked differentiation, if we could discover the real maximum 
difference population? We cannot say; and in view of this uncertainty, it seems 
to us needful to drop for the present any criterion of mortality differentiation 
depending on the so-called corrected deathrates. 


We can only conclude that the proper test for differentiated mortality is the 
Xo” test used for the first time in the present paper. For this test does not depend 
on the measure of the divergence between two means—“ corrected” it may be,— 
but on the general difference between two frequency distributions as wholes 
and this appears to us the essential feature of any true measure of differential 
mortality. 


We have to thank our colleagues Mr A. W. Young, Mr I. Horwitz and 
Mr George Rae for much assistance in a piece of arithmetical work more arduous 
than may appear on the face of this paper. 


ON CERTAIN PROBABLE ERRORS AND CORRELATION 
COEFFICIENTS OF MULTIPLE FREQUENCY DISTRI- 
BUTIONS WITH SKEW REGRESSION. 


By L. ISSERLIS, D.Sc. 


(1) In the systematic investigation of the statistical constants of multiple 
correlation and of their probable errors, it is important to have to hand the probable 
errors and the mutual correlations of the more fundamental constants—the means, 
the standard deviations and the correlation coefficients. For the case in which 
the frequency distribution follows the normal law this need is supplied in the 
memoir by Pearson and Filon entitled “On the Probable Errors of Frequency 
Constants and on the Influence of Random Selection on Variation and Correlation®.”’ 

As regards the more general case in which the regression is skew, the probable 
error of a correlation coefficient was first given by Sheppard (Phil. Trans. Vol. 192, 
A, p. 128). 

The probable error of a mean and the correlation between deviations in the 
value of the mean and that of a standard deviation, or of a correlation coefficient, 
and the correlation between two standard deviations, are given by Pearson 
(Biometrika, Vol. 1x. 1913, pp. 1-10). 

For reference we give here the results for the case of normal distributions 
obtained by Pearson and Filon in the memoir referred to above: 


2S o,/V2n BP ns afer eral Te ela Gaye Cee See Oe eee wae o's ines OS (1), 
Yo US Fs) EL cosocie ere roca eee ee ee (2), 
LNG Wat omer ce mare eer canis Ant lacntises ae nea vie hack Svein vealsuna ch wawiales slow (3), 
Re rs == Pap V2 Se eadedeat Ob BU ENE BONA “6 IaNe Sere te CO ea a (4), 
M12 (13 — Tre%eg) + 113 (M12 — 113793) 
Re tres = V2 zy = ay. a BER BOCOROSHE RE OED Meee eee (5), 
ok Nis tiaieait sa aig = felt 2712113123) 
Ry ata = lo3 a -) (1 = 715) (1 a 743) as Boy Uiieisioteleleieicierelalove*stereieys (6), 


| (713 — T1273) (Toa — 723 %3a) + (14 — 134713) (123 — 112713) | 
R _ A+ ("13 = 7147s) (Taq es Tyo 14) AAU ee T2124) (Tog — To41ga) (7) + 
rata 2 (1 = 7%y2) (1 — 9) 7 


* Phil. Trans. Vol. 191, A (1898), pp. 229-311. 
t Phil. Trans. Vol. 191, A, Equations (xv)-(xviii), (xxxvi), (xxxvii) and (x1). 
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In the present paper the corresponding results are obtained for the case of skew 
regression. The method employed is different and by supposing the regression 
to be linear and the distribution to be normal, a confirmation is obtained of the 
above results which in Pearson and Filon’s memoir depend on very complicated 
analysis. 


(2) We may begin by discussing the correlation that exists between deviations 
from their means in the case of two correlation coefficients 7,, and 7,;. 


We have Piri = Daryl A/MPar an). «siscee selena tect eee (8), 


Tak =" Del) (Dez Did oce- kiares doeteects «oe ee Eee (QO) 


where pyr ymznge 18 employed to denote the mixed moment of orders 1, m, m, k in 
the variables, taken about the means so that - 


dy dps, \ dpa al dp: 
Voy Pay 2 Poe 2 Py 
and it: MiP Ape Lane re (11). 
Vee Pu 2 Px 2 pe 
It is clear that we shall require the correlations between any one of Py,, Py2, Py 
and any one of p,,, pz and pp. It will suffice to find the correlation between 


Puy and 4. 


Now Nyy = SS Aig, (Ci 2) Yi — Yn. cae sae ee Ree (12), 
xy 
Ndpyy = SS (dry, (& — Z) (y — 9} 
"+ 8S{—ney (y — 9) dz} + SIS{— ny (a — Bey a 
xy Ly 
or Nap, 9S 'S (ding, XY eect hace see eee (14), 
“cy 


if we denote the total population by N, x —% by X, y—y-by Y and remember 
that 
x y 


Similarly. Ndpa = 8 Sidn,p ZT) 10600 tae ee (15). 
Zt 


The mean value of dn,,dn,, in many samples is the mean value of 


ei (On aes x 4 S {dnz, y,, Pik 
1 
eae SSS Ld yy znty Mar yy arts 33 Maesys 2 ty (irs Me yy etl N )-.-(18), 


where in the fourfold summation the term 


(dn 


\2 
XeY yi ly 
is omitted. 


But clearly the right-hand member of (16) reduces to 


Ney zty -, Nyy Naty|N, 
hence the mean value of 


N*dpsy@Pze ISN (Daye Deg Dt) axes cere ee (1): 


iw 
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Putting t = z we deduce from (17) 


Mean value of dp,,dp,2 = (Poy: — Pay P2)|N scrcevceveevens (18). 
and putting y = x in this result, 
Mean value of dp: dpj2 = (pa. — Par Put)|N  veseeeeeeeneeee (19). 


If we multiply (10) and (11), sum for all samples and divide by the number 
of samples we deduce 


No, , ee Ry yr Tau" at 
__ Pavat — PovPet 1 (Poust — PowPee) _ 1 (Pave — PavPe) 
Puy Pzt 2 PayP2 2 Poy Pe 
lL (Datet — Par Pat) 1 (Pyrat — Pur Dat) 4 1 (paee2 — Pa? Pz?) 
2 Paz Pet 2 PyrDet 4 Py Prz2 
i 1 (Pare — PoPr) 1 (Py2z2 = Pyz Pz) ae i (Pye = PyPe) (20) 
T 5 é 4 i hres: a . 
4 Pw Pe 4 Py Pa PyPe 


This result like Sheppard’s formula for o°, is much simpler when expressed 
in reduced moments. Let us write 
Palymart 
Groynata, 


ie a Jechy 2h th » 


so that q,: 1s unity and q,, =7,,. The numerical term in (20) is 


—14+4(4)+4(-4) 
or zero, hence 


Noa 


Cal Voy? 
Tay V2t rep taal eye zt 


OPK: 1 (ey22 + Veye Ae Ones 1 
= feat _ 5 (fom Moves | detet Mite) 5 gases + date + dee + Gee) 


Pay! 2t a Vey Txt 


masents (21). 

In the same notation Sheppard’s formula becomes 
ieee (Qarv . 4 taaveate qaty + fey (99) 
Py Inf gaa + 4(Bs + B's) + ary Pe (ge as (22)*. 


To find the correlation between r,, and 7, we have only to replace ¢ by x in 
(21), thus 


No;., ore Ry, Tae "xy x2 
Ya2 yz 1 Jaye + Ja8 Yaiz + Qay2z I ' \ 
= aoe mo (‘ io wage ae fay a a (Gatee + Gat + Ya242 + Yy2z2) 


Voy lez 


(3) These correlation coefficients will simplify if the regression be linear and 
simplify to a considerable extent if at the same time the distribution be normal. 
For with linear regression 


ND ye =SSS (May, t?Y2) T 
xy z 
=O SNe Say) 
ry 
where Z,, is the mean value of z for given values of z and y. 


* For the denominator of left-hand side, cf. Biometrika, Vol. 1x. p. 4. 
+ The origin being taken at the mean. 
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But from the usual regression equation 


> Tez — Vaey"yz\ Fz lyz —Ve2zVey\ Fz 
“gy a" _ 9 a IF Yy ’ 


2 22 
LT On Dare 


so that for linear regression 


Vag — Voylyz\ Fz Lyz Vel ey\ Fz ¢ 
Patyz = ( To 2 x ) é Deke A (= Pe) ") 3, Pry® vevccverenes (24). 
ey @ xy y 
Further Da, = x SS (Npyx?y) 
cy 
ee S204 
1 
= we (nea 7 7) 3 
or Pedy = Pa Tay = wisisle)sicleleiarseielplelefelsierejoielsfslefelelehoeietefefoteteteletets (25), 
while Guaye = 1 + 77,,V (8s = 1) (6'5 — 1) approximately a... (26)*, 


so that (23) can be evaluated approximately by the use of simple moment 
coefficients and correlation coefficients only. Jf in addition the distribution be 
normal, we know that 

Dasy = 3PeyPe and py p= (1+ 272,,)/pepy, 
so that for normal distributions 


(np ("= — “a s) 3 e Yy2 — Vostay L + 2 oy 
6 +2 , . 
Pry Pez Icey 2y lez Ws te Payl ez 
ER a ie Ys 27 
or = am ee Taj] Vasa Vice ee Ta aes oa er OEE (27). 
LY" LZ 
plies 2 Qe tee 
Similarly Love Ea, gre EET ee Ea SU re BER ab colic. (28), 
Loy ley 
d foots 1 + Qyetaylt (29) 
an mee Qi at V rau | Taig oe oaisenne s4onelace ne noes eee ; 
UZ 


Substituting these values in (23) we obtain, after some reduction and using 


= (1 — 72, (VN, 


(1 — 972) (1 — 1 ee) & 


Tay az 
2 2 2 ily, , 2 a2, 2 
=%yz (1 ae tu a) (1 ali 3) — 27 aylaz (1 PP gy — Toyz — Tze i ate tee) 


agreeing with (6) the value obtained by Pearson and Filon for normal distributions. 


As regards the more general case dealing with the correlation between deviations 
of 7,, and those of r,, given by equation (22), we have when the regression vs linear 


Proyzt = SS S S {Neyzt (wyzt)} 
CUES at 
= SSS {ney i cyctaues 
ip DD) 2 


* Biometrika, Vol. 1x. p. 4. 
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where ¢,,, is the mean value of ¢ for given values of x, y and z, so that as is well 
known 


i Oo; Ave Ot ae Ot A: B31) 
rae at ay Ee ETE ac seecsennene (31), 
Oy Net Oy Ate Oz Att 
where = te Teys Vozs Vat 


loz» Vyz ? i Vet | 
lots Pyt» Vxts i 


z 


and A,, is the minor corresponding to 7,,. Thus 


Vayzt a (Ane Gaze ar Ny eVasts + Net Qave2)| Ate Cy (33), 


r, can be evaluated approximately in the case of linear regression 
7 S 


so that R, 

xy 
without employing any mixed moments beyond the simple product moment 
occurring in a correlation coefficient. 


For normal distributions we may use (27), 28) and (29) giving 
Vouzt = — [Az (27a lag ar Ty) at Av (Zou laa =i frese) AE Au (2% a2T yz aE Tay) Ase ...(34). 


By well-known properties of first minors of a determinant we have from (32) 


TA + Vay Aye + Voz Noe + T xt Aus = 0 ee (35), 
Voy Aa 4- Ayt + Tyz Asi + Put MG = 0 Ce (36), 
Vag Nag +- Vyz Aut + Avi + TTAWT = 0 eee eee e eres eee rcccsnncs (37). 


Multiply these equations by 7,,, 7, Try respectively and add, 
(Gz an Mel ay) a ais Ges + Diigo aa) Ay 
ap (Tay SF 21 y2% x2) Aut aE (TueTat ae Veal yt ae Toy? at) An = 0...... (38). 
Combining this result with (33) we see that for normal distributions 
Cty ei Vy Tae yp eT el tee cain aleliornincisaae eBePs (39)*, 
an interesting result likely to prove useful in other applications and probably 
capable of generalisation Particular cases of (39) are obtained by putting ¢ = z 
so that qy2y. = Tyz + 2eyxz Which is (27) andt=2,z=y giving q,2,: = 1 + 2r?,, 
which is well known. 
If we now substitute these values in equation (21) we find 


No 


Tay et K,., Vet 
a, WP ye V ot ns {Pal yt a Waelye lat aa 27 £0 yt at 
— WVoeel atl oy — Maeliat tary + Vey l zt Rone She ioe oF a Ae 74). 
The right-hand member can be put in the form 
2 {Tat = Taatat) (Tye Ey Tay Toe) le (Txt a Toy yt) (Ty. pa Ty? at) 
ar (Ze oa Tau yz) (Tyt bas Tatas) ste Give a Tet ys) (Tye ae Tou lae)t> 


* This result, which is accurate for normal distributions, is given as approximately true for such 
distributions by H. E. Soper, Biometrika, Vol. 1x. p. 100 
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and if we remember that for normal distributions 
Gy =(1L—PaylVN,  o7, = (1—12)/VN, 


this result agrees with Pearson and Filon’s value quoted above as equation (7) 


To find the probable error of a standard deviation. 


(4) 
Gur = Px» 
do, apy 
On — 2042 Cr (40). 
x o Pai — pers 
Hence a dpa by (17) 
acne 
4N ’ 
eee VES 0s Sas (41) 
i 2VN 


This result is well known, and for normal distributions, i.e. when 8, = 3, becomes 


agreeing with (1). 


2 Ge 
— /2QN’ 


To find the correlation between a standard deviation o, and a correlation 


Cy 


coefficient r,,, we multiply (40) by the equation 
Or yz ef dDy: dD dp 
Vyz Pyz {py 2p.2 
and sum for all samples and divide by their number in the usual way, obtaining 


2Nog 07, Re ry,| Fa ve 
(pe 22a Px2Py?)|Pe2Py Da 4 (Dy222 =e Daz P22) a2 Px? 


= (Dass ed Px Pyz)| Pa2 P vz 3 

= Gave Wan ee Fy ee ee 
Vig » a 

a result which as before can be approximated to in the case of linear regression, and 


which for normal distributions becomes* 
Pye One Nee 
= ay BEE os : (2 | De ay =r Zr eres) 


7, Vyz 
x9" y2 ¥ IN Ox (1 a lp) 1 
“VJIN' VN Gab ye 
27 ay Tae Ee + 1 2) Pye (43) 
BSSOOOOOOOOOCIOOr OOOO on 9 


Sey 2a) 


. . LY = 
agreeing with equation (5). 
* For the case z=, ie. R , cf. Biometrika, Vol. 1x. p. 8. 
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ON THE CORRELATION BETWEEN THE “CORRECTED ” 
CANCER AND DIABETES DEATHRATES. 


By C. A. CLAREMONT, B.Sc., Biometric Laboratory, University College. 
i} 


In a paper due to Maynard published in this journal* it is shown that in the 
United States in the case of both states and great towns there is a marked correlation 
between the corrected deathrates from cancer and diabetes. The point is of very 
great interest, and additional investigations by Pearson and others+ and by 
Greenwood and Frances Woodt confirm Maynard’s result for the material used 
by him. On the other hand Greenwood and Wood show that it is not apparently 
true for Switzerland. 

Maynard writes§: “If the increased rates observed in the cases of cancer 
and diabetes were due to a common cause then it is probable that their rates of 
growth will be found to be fairly highly correlated. Not being able to obtain 
rates for a sufficient period of time from the United States reports, the rates given 
in the Registrar’s Report for England and Wales, in five-yearly groups for the 
35 years 1871 to 1905, were used and here p = -8060 + -0893. This high correlation 
shows I think a strong probability that there is a common factor influencing the 
increase of both diseases. That this value is explicable on the assumption that 
the increased rates are merely apparent and due to more careful diagnosis is in 
view of the facts already mentioned almost inconceivable.” 


In view of the development of the variate difference correlation method since 
the publication of Maynard’s memoir, it seemed possible to test this point, and 
at the suggestion of Professor Pearson I undertook the necessary calculations]. 
It is to be noted that it is partly the continuous rise in the deathrates from these 
two diseases during the last few decades that has suggested a possibly organic 
relation between them. This increase is effectively shown in the Diagrams of the 
corrected deathrates I and II for these diseases, and in the combined “spot” 
diagram for individual years, III, which brings out the contemporary rises. 


* Biometrika, Vol. vu. pp. 276-304. 

+ Journal of R. Statistical Society, Vol. 73, p. 534. 

t Journal of Hygiene, Vol. x1v. p. 83, 1914. § loc. cit. p. 289. 

|| I have heartily to thank Miss B. C. B. Cave for revision of my arithmetic and the correction of 
several errors. 
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Dracram III. Cancer and Diabetes. Corrected Male Deathrates showing scatter of 
pairs of occurrences in the same years. 
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The diagram indicates an apparent highly correlated change. 


It should be noted, however, that Newsholme and King* conclude that the 
increase of ecancer is apparent only and due to improvement in diagnosis and 
more careful certification of causes of death. Whether there has been a real 
increase in diabetes is a question that these authors consider to be undecided. 
It is difficult, however, in the case of England and Wales to believe that the 
increase in the corrected deathrates for both cancer and diabetes that has gone 
on continuously since 1900 can have anything to do with increased accuracy 


* R. S. Proc. Vol. trv. p. 228 (1893). An examination of our Diagram I shows that the increase of 
the corrected cancer deathrate from 1873 to 1893 was practically the same as from 1893 to 1913. If 
improved diagnosis is the source of the increased cancer rate it is certainly remarkable that the 
improvement should have been so uniform for the space of forty years. 
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of diagnosis of either disease in this recent period, and the general sweep of the 
plotted curves seems to show that the source of the increase whatever its nature 
has been continuous since the middle of last century. 


In the following investigation I have correlated the deaths from cancer and 
diabetes for the male population only. I had intended to deal also with the female 
population, but the results for the male were so conclusive, that I did not think 
it needful to go further. But the idea of dealing with the female deathrates led 
me to select as standard population the total male and female population of 
England and Wales as given by the 1901 census. Of course the choice of the 


Diacram V. Associated Inflexion Points in Corrective Factors and °/, Males under 45. 
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Diagram illustrating that the inflexion point in the corrective factor diagram (IV) corresponds 
to actual changes in the age distribution of the population. 


standard population to obtain a corrected deathrate is arbitrary, but if female 
deathrates are to be compared with male, we must take a common standard for 
both, and this is why the combined male and female age groups of 1901 were taken 
as the standard population classes. 


A word must be said as to the manner of correcting the crude deathrate for 
age groups. Actually we only know the age groups accurately for the census 
years; the age groups for intervening years are more or less guesses not beyond 
suspicion. Accordingly it was considered best to obtain the corrective factors 
for the crude deathrates in each census year, then to plot these to the year, draw 
a continuous curve through the plotted points by aid of a spline and read off from 
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this curve the corrective factors for non-census years. This was done on a large 
diagram of which IV is a small reproduction. 


A noteworthy feature of these corrective factor curves is the inflexion which 
occurs between 1861 and 1871. There are two tests of its reality. First the 
diabetes curve confirms the cancer curve and secondly as far as the data permit 
an approximative factor was obtained for the cancer corrective factor curve for 
the census of 1851; it is seen to substantiate the general sweep of the corrective 
factor curve and confirms the inflexion between 1861 and 1871. Diagram V 
shows that the inflexion also occurs in the age distribution of the population as 
exhibited in parentage of males under 45. 


In dealing with the years 1875 to 1880, the few deaths separately recorded 
under Melanosis, Fungus Haematodes and Sweep’s Cancer—now entitled simply 
cancer—were included under the total deaths from cancer. 


The following table showing the work for the year 1911 indicates the manner 
in which the census year corrective factors were found. The crude deathrates 
per million of the male population in this year were: Diabetes 110-96 and 
Cancer 893-57. 


TABLE I. Corrective Factors for Census Year 1911. 


Dene Corrected Deathrates 
Cc Dia- Census Deathrate | sate per Standard | _ 
Age eis betes | Population per million, | Population, | 

Groups Mal. S| Deaths (Males), million, | Diabetes| 1901, per Cancer | Diabetes 
(Males) | (Males) 1911 Cancer=r,| _, | million A 7A _ 1A 
| ; = 108 = 108 

| ay 

Under 5 64. ll 1,936,113 | 33:1 5:7 114,262 3°78 -65 
5—10 51 22 1,847,295 27-6 11-9 107,209 | 2-96 1-27 
10—15 39 36 | 1,747,631 22-3 20-6 102,735 | 2-29 2-12 
15—20 60 67 1,654,895 36:3 40-5 99,796 3°62 4-04. 
20—25 95 71 1,502,652 63-2 47-3 95,946 6:06 4-54 
25—35 317 157 2,831,655 111-9 55:4 161,580 | 18-08 8-95 
35—45 978 183 2,336,508 | 418-6 78:3 122,848 | 51-42 9-62 
45—55 2901 252 1,694,333 1712-2 148-7 89,222 152-77 13-27 
55—65 4627 461 1,085,156 4263-9 | 424-8 | 59,741 254-73 25-38 
65—79 4602 512 602,764 7634:8 849-4. 33,080 252-56 28-10 
75—85 1687 155 183,869 9175-0 843-0 12,090 | 110-93 10-19 
85— 168 9 22,737 7388°8 | 395:8 eA Ole | 11-02 +59 
All ages | 15,589 1936 | 17,445,608 893-57 | 110-96 1,000,000 870-22 108-72 


Accordingly for 1911 we have corrective factors for: 
Cancer = ,F19,, = 870-22/893-57 = -97386. 
Diabetes = ,Fy9,, = 108-72/110-96 = -97981. 
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The accompanying Table IT gives under each year the total male population, 
the total deaths from cancer and from diabetes, the crude deathrates, the corrective 
factors and the corrected deathrates. From the latter the first six differences, 
A, to A,, were found and these were correlated for the two diseases. The following 
correlations were found: 


Correlation of Differences of Corrected Deathrates. 


Corrected Deathrates “ae ies ue See 47 + -958 + -008, 
First Differences coh ext wee Sea oes + :058 + -113, 
Second ,, Paty =. ae ans Aor + :043 + -130, 
Third = ote ae See be <a + :047 + -143, 
Fourth _,, Hee ah ae eh a6 + 051+ -154, 
Fifth . De see ace ae ae + 050-4 -164, 
Sixth s See Ae ae nae nae + °045 + °173. 


It will be seen that with the first differences we get an enormous drop in the 
correlation of the cancer and diabetes deathrates, i.e. from + -958 to + -058, and 
that very rapidly the correlation becomes steady* and insignificant. Thus with 
the removal of the time factor there appears to be no organic relationship between 
the prevalence of diabetes and cancer. A year in which there is an excess of diabetes 
deaths is not a year which will probably have an excess of cancer deaths. 


Now this result although absolutely conclusive as far as it goes, must not be 
stretched beyond its exact limitation. There is nothing to show that an increase 
of cancer at any given epoch will be accompanied by an increase of diabetes. 
It might be argued that it is conceivable that an increase of diabetes will be 
followed at an interval by an increase of cancer. I have not directly tested this, 
but it is to some extent indirectly tested by the method of variate differences. 
The sixth difference correlation correlates functions of the deathrates for seven 
years in the case of two diseases, and would be likely to indicate if there were any 
' such related succession. At the same time it must be remarked that previous 
investigators have all dealt with the contemporaneous deathrates of cancer and 


* This steadiness can be illustrated also by the method discussed in Biometrika, Vol. x. p. 272 and 
illustrated on p. 346. 


Values of o* 4m ,,/0? and their approach to 4-2, 


Am-ly 
m Theoretical Ratio =4 — 2 Cancer Diabetes Mean 
1 2-000 0°655 0°830 0°742 
2. 3:000 2°589 2:969 2:779 
3 3°333 3°431 3-386 3°409 
4 3°500 3°446 3°549 3°497 
5 3°600 3-719 3°627 3°673 
6 3°667 3°653 3°669 3°661 


It will be seen that there is a rapid approach to the theoretical values of the ratio. 


TABLE II. 
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Cancer and Diabetes corrected Deathrates and Corrective Factors 
England and Wales 1860-1913. 


| | 
| SI o o eo 
\ Pe or | 26 3 63 
ae Pes 72 02 0 cross 
SS | O88 BES | SS 
His Ad AS | oAA 
346 35-7 | 1-0542 37-6 
363 37:0 | 1-0541 39-0 
382 38-5 | 1:0537 40-6 
359 35-7 | 1-0536 37-6 
454 44-6 | 1-0538 47-0 
430 41-8 | 1-0541 44-0 
437 41-9 | 1-0545 44-2 
434 41-1 | 1-0551 43-4 
445 41-6 | 1-0559 4399 
497 45-9 | 1-0569 48-5 
470 42-9 | 1-0580 45-4 
541 48-8 | 1-0593 51-7 
513 45-6 | 1-0605 48-4 
550 48-3 | 1-0623 51:3 
528 45-7 | 1-0642 48-6 
614 52-5 | 1-0662 55-9 
582 49-1 | 1-0683 52-4 
663 55:1 | 1-0703 59-0 
669 54-9 | 1-0722 58-9 
645 52-2 | 1-0738 56-0 
624 49-9 | 1-0752 53-7 
769 60-7 | 1-0762 65:3 
733 57-2 | 1:0765 61-6 
838 64-7 | 1-0764 69-6 
886 67-7 | 1:0759 | 72-8 
896 67-8 | 10752 | 72-9 
978 73-2 | 1-0740 78-6 
1019 75:5 | 1:0724 80:9 
1070 78-4 | 1-0714 84-0 
980 71:0 | 1-0697 76-0 
1076 77-2 | 1:0679 82-4 
1082 768 | 1-0662 81-9 
1142 80-1 | 1-0642 85-2 
1169 81-1 | 1-0622 86-1 
1166 80-0 | 1:0603 84-8 
1262 85-6 | 1:0584 90-6 
1243 83-4 | 1-0560 88-1 
1303 86-4 | 1-0536 91-0 
1437 94-2 | 1-0510 99-0 
1448 93-9 | 1-0480 98-4 
1455 93-3 | 1-0445 97-5 
1565 99-2 | 1-:0408 | 103-3 
1462 91-8 | 1-0368 95-1 
1409 87-5 | 1-0322 90-3 
1612 99-1 | 1:0270 | 101-8 
1708 | 103-9 | 1-0214 | 106-1 
1736 | 1045 | 1:0153 | 106-1 
1738 | 103-6 | 1:0070 | 104-5 
1842 | 108-6 | 1-0022 | 108-8 
1898 | 110-8 9948 | 110-2 
1982 | 1145 | -9873 | 113-0 
1936 | 111-0 9798 | 108-7 
2064 | 116-8 “9716; | L135 
2122 | 118-8 9637 | 114-5 


oad | o o o 
a0 fo) we rs Sf ts 
| Se | #8 | S82] 888 | 282 
ee g a 3 i) 
1860 | 9,704,394 |} 2100] 216-4 | 1-0544 | 228-2 
1861 | 9,801,152 | 2180 | 222-4 | 1-0523 | 234-0 
1862 | 9,923,272 | 2256 | 227-3 | 1-:0498 | 238-6 
1863 | 10,046,909 | 2311 | 230-0 | 1-0480 | 241-0 
1864 | 10,172,089 | 2459 | 242.7 | 1:0465 | 252-9 
1865 | 10,298,826 | 2389 | 232-0 | 1-0452 | 242-5 
1866 | 10,427,146 | 2532 | 242-8 | 1-0444 | 253-6 
1867 | 10,557,066 | 2650 | 251-0 | 1-0439 | 262-0 
1868 | 10,688,600 | 2743 | 256-6 | 1-0440 | 267-9 
1869 | 10,821,775 | 2933 | 271-0 | 1-0446 | 283-1 
1870 | 10,956,608 | 2971 | 271-2 | 1-0460 | 283-7 
1871 | 11,092,620 | 3060 | 275-9 | 1-0478 | 289-1 
1872 | 11,242,495 | 3228 |] 287-1 | 1-0506 | 301-6 
1873 | 11,394,394 | 3337) 292-9 | 1-0540 | 308-7 
1874 | 11,548,346 | 3470 | 300-5 | 1:0575 | 317-8 
1875 | 11,704,378 | 3614 | 308-8 | 1:0615 | 327-8 
1876 | 11,862,519 | 3708 | 312-6 | 1-:0650 | 332-9 
1877 | 12,022,796 | 3950 | 328-5 | 1:0686 | 351-0 
1878 | 12,185,238 | 4164 | 341-7 | 1:0719 | 366-3 
1879 | 12,349,875 | 4149 | 336-0 | 1-0747 | 361-1 
1880 | 12,516,737 | 4423 | 353-4 | 1-0770 | 380-6 
1881 | 12,673,435 | 4611 | 363-8 | 1-0787 | 392-4 
1882 | 12,808,460 | 4685 | 365-8 | 1-:0795 | 394-9 
1883 | 12,944,923 | 4967 | 383-7 | 1-0800 | 414-4 
1884 | 13,082,837 | 5346 | 408-6 | 1-0804 | 441-5 
1885 | 13,222,216 | 5495 | 415-6 | 1-0803 | 449-0 
1886 | 13,363,079 | 5754 | 430-6 | 1-0798 | 465-0 
1887 | 13,505,441 | 6262 | 463-7 | 1-0794 | 500-5 
1888 | 13,649,314 | 6284 | 460-4 | 1-0787 | 496-6 
1889 | 13,794,721 | 6891 | 499-5 | 1:0777 | 538:3 
1890 | 13,941,671 | 7137 | 511-9 | 1-0766 | 551-1 
1891 | 14,092,535 | 7294 | 517-6 | 1-0754 | 556-6 
1892 | 14,252,190 | 7547 | 529-5 | 1:0740 | 568-7 
1893 | 14,413,657 | 7908 | 548-6 | 1-0728 | 588-5 
1894 | 14,576,948 | 8077 | 554-1 | 1-0715 | 593-7 
1895 | 14,742,091 | 8628 | 585-3 | 1-0698 | 626-2 
1896 | 14,909,104 | 9216 | 618-1 | 1-0682 | 660-3 
1897 | 15,078,010 | 9573 | 634-9 | 1-0662 | 676-9 
1898 | 15,248,823 | 9932 | 651-3 | 1-0639 | 692-9 
1899 | 15,421,578 | 10337 | 670-3 | 1-0614 | 711-5 
1900 | 15,596,283 | 10475 | 671-6 | 1-0583 | 710-8 
1901 | 15,769,412 | 10891 | 690-6 | 1-0550 | 728-6 
1902 | 15,933,658 | 11098 | 696-5 | 1-0505 | 731-7 
1903 | 16,099,612 | 11799 | 732-9 | 1-0453 | 766-1 
1904 | 16,267,291 | 12086 | 743-0 | 1-0390 | 772-0 
1905 | 16,436,707 | 12470 | 758-7 | 1-0318 | 782-8 
1906 | 16,607,890 | 13257 | 798-2 | 1-0238 | 817-2 
1907 | 16,780,848 | 13199 | 786-6 | 1-0150 | 798-4 
1908 | 16,955,609 | 13901 | 819-8 | 1-0056 | 824-4 
1909 | 17,132,182 | 14263 | 832-5 | 0-9953 | 8928-6 : 
1910 | 17,310,586 | 14843 | 857-5 | 0-9846 | 844-3 | . 
1911 | 17,490,847 | 15589 | 893-6 | 0-9739 | 870-2 | : 
1912 | 17,672,985 | 16188 | 916-0 | 0-9620 | 881-2 | : 
1913 | 17,857,014 | 16918 | 947-4 | 0-9580 | 907-6 : 


200 Correlation of Cancer and Diabetes Deathrates 


diabetes. Of course such an investigation as the present does not provide any 
measure of whether persons contracting diabetes are more liable to die of cancer, 
for the deaths would be registered as deaths from cancer, yet this not improbably 
is the vital problem*. The result reached, however, is consistent with a continuous 
increase of both cancer and diabetes not essentially related organically with each 
other. In this case Maynard’s results for the United States would mean that 
in that huge population the towns and states were in heterogeneous stages of 
historical development, whether as to medical training, accuracy of record or 
cultural conditions, while the non-appearance of the high correlation in the Swiss 
data would merely signify that the units chosen there were homogeneous in such 
characters, and this considering the size of Switzerland is not improbable. In 
other words some American districts present with regard to cancer and diabetes 
the English conditions of 1870 and others the English conditions of 1910. The 
time correlation of the English deathrates may thus correspond to the geographical 
correlation of the American deathrates. If so, since, when the time factor is 
removed, the English diabetes and cancer deathrates show no organic correlation, 
we cannot use the English material to support the view that the American relation- 
ship is organic in character. The correlations that Maynard has indicated between 
both cancer and diabetes deathrates and the spread of insanity, suicides and the 
newspaper press, seem indeed to indicate not an organic relationship of the two 
diseases, but as Maynard himself has suggested a wider range of both with 
advancing cultural conditions. 


* The enquiry as to whether cancer patients are suffering or have suffered from diabetes should 
always be made and the answer recorded. 


A CONTRIBUTION TO THE PROBLEM OF 
HOMOTYPOSIS 


DATA FROM THE LEGUME CERCIS CANADENSIS 
By J. ARTHUR HARRIS, Pu.D., Carnegie Institution of Washington, U.S.A. 


I. Intropuctory REMARKS. 


In 1905, I began the collection of series of data for a comprehensive study 
of the problem of intra-individual or “homotypic”* correlation for fertility and 
fecundity characters in plants. Such characters were chosen not merely because 
of particular interest in that subject in general but because of the conviction 
that the intra-class and inter-classt correlation methods applied to the problem 
of fertility and fecundity might yield results of considerable physiological interest. 
Since that time, circumstances have prevented my carrying out the work along 
the lines originally laid down. The constants given here are drawn from notes of 
the work then done. 


II. MaAtvErtAts. 


The data presented are exclusively those for pod length /, number of ovules 
formed o, number of seeds developing s, and number of ovules failing to develop 
jf, in the small arborescent legume, Cercis Canadensis. Three collections are 
involved: one from Meramec Highlands, near St Louis, Mo., made in the autumn 
of 1905; one from the vicinity of Lawrence, Kansas, taken in 1905; one from 
the neighbourhood of Sharpsburg, Athens County, Ohio, gathered in 1908. These 
materials have already been considered in dealing with other problems quite 
distinct from the present one; the reader seeking more comprehensive information 
with regard to the materials used should consult the papers cited below ft. 


For the Meramec Highlands collections two sets of symmetrical homotypic 
correlation tables were prepared. 


* K. Pearson, and others: Phil. Trans. B, Vol. cxcvi. pp. 286-288, 1901. 
+ Biometrika, Vol. 1x. pp, 446-472, 1913. 
t Bot. Gaz. Vol. L. pp. 117-127, 1910; Bull. Torr. Bot. Club, Vol. xii. pp. 243-256, 1914 
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I used the first 26 pods taken from 112 trees. The desirability of working 
with a larger series of data from individual trees then appealed to me and a table 
comprising the first 100 pods from the 60 trees from which that number could 
be secured was undertaken. For both of these series all the possible direct and 
cross homotypic correlations were ascertained for the fertility characters. 


The Kansas series comprised only 22 individual trees, from each of which 
100 pods were counted. The Ohio series included 150 pods each from 26 trees. 
For these two series all possible homotypic correlations for the fertility characters 
have been determined. 


Ill. Meruops. 


The three direct intra-individual or homotypic correlations are: 
(1) Ovules of first pod and ovules of second pod, Tables I, II*. 


(2) Seed developing in first pod and seed developing in second pod, Tables 
TAL. 


(3) Ovules failing in first pod and ovules failing in second pod, Tables V, VI. 


The cross-homotypic relationships are: 

(4) Ovules of first pod and seeds developing in second pod, Tables VII, VIII. 
(5) Ovules of first pod and ovules failing in second pod. 

(6) Seeds developing in first pod and ovules failing in second pod. 

Tables for two of the cross correlations are unnecessary. Thus for the relation- 


ship between the number of ovules in the first pod and the number of ovules failing 
in the second pod, 7,,;,, the slope of the regression line is clearly 


OF, %o, Os, 
lor fe Tat To102 ein To1s2 2a): 
5 rn 95, Oo, Oo, 
or in terms of correlation 
Fo O59 . 
pi EN = ie) Gdodndsadodouns Sdonos d000ce (1) 
Of, OF, 


Again, for number of seeds in the first pod and number of ovules failing to develop 
in the second pod the slope of the regression straight line is 


OF, 5 O55 
UGiE = am UG ee on UG) SS 
Os, S1 S1 
Oo, Os5 oe 
whence Ysifo = Vs109 LE he meairennne noo sesocberad one (11) 
OF, OF, 


In calculating the correlations for length of pod in the Meramec Highlands 
series and for the fertility of characters of the Ohio and Kansas series I have had 
recourse to the direct and cross intra-class correlation formulae (v)-(xil) of 


* The numbers refer to the tables for the Meramec Highlands series. 
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Biometrika, Vol. 1x. pp. 450-452, 1913. In present notation the product summa- 
tions are for the several correlations: 


HOM epee mS EON) — IS: [a(O') LING sues 0st ceeencevenaveneae (iii) 
Pe eS 5) SE Ge MIN ees acateenetanss (iv) 
eNO Of) IP [CPA IPN gee ssenesoeh sons eesduese cnet (v) 
ee SIS (oF) 21 (5) — SLD Core IN, sce co ccszsccse on. (vi) 
Pea AON (OG ) af) — SO VV IIN 2.8. cst reaseeses (vii 
Peet eas [ee (S:) auf) — SL (SF RIN cco eecensesdenw en (viii) 


where N is the population of individuals resulting from the n(n — 1)-fold 
weighting, & denotes a summation for the individual pods of a class and S a 
summation for the classes (individual trees). For data see Table IX. 


In the first three of these the negative sign term of the formula is merely the 
second moment of the unweighted population and in the fourth it is the product 
moment of the correlation surface 7,,—the “organic” relationship between 
ovules and seeds of the same pod—from which the three second moments, & (0’?), 
x (s’2), X(f’?), may be calculated. The 5th and 6th cross correlations may be 
determined from the other correlations by (i) and (ii), or since f=o—s the 
minus terms in (vii) and (vill) are given by 


S[X (o'f')] = S[% 0?)] — S[2 0's’), 
S[X (s'f')] = S[% (s'0')] — S[% (s”)], 


which may be most easily calculated from the published tables for r,,, since all 
classes are equally large. 


IV. PRESENTATION OF Data. 


The constants for both direct and cross homotypic correlations are shown for 
all series in Table A. 


The reader, in examining these constants, will remember that the two Meramec 
Highlands series were taken from the same habitat and in the same year. They 
are known to differ only in (i) the smaller number of pods from each individual 
in the first series, (i1) the fact that the first series contains pods from 50 individuals 
which do not occur in the second lot. The Ohio and Kansas collections on the 
other hand represent materials of the same species from localities several hundreds 
of miles distant. Unfortunately, both of these contain too few individuals to be 
fully trustworthy. 


There can be no reasonable question of the statistical trustworthiness of all 
the direct homotypic correlations. The lowest is that for ovules failing per 
pod and this is in all cases six or more times its probable error. The coefficients 
To,029 Vsyso are from 25 to 45 times as large as their probable errors. For the most 
part the cross correlations may also be regarded as statistically trustworthy. 
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TABLE A. 


Homotypic Correlations* for Cercis. 


Meramec Meramec Sharpsburg, Lawrence, 
Combination | Highlands, Mo., Highlands, Mo., Ohio, Kansas, 
112 Trees 60 Trees 26 Trees 22 Trees 
Ovules of First Pod 
and Ovules of Second 
Pod ... *3524 + -0109 *3527 + -0076 | -2768 + :0100 | -3999 + -0121 


Seeds of First Pod and 
Seeds of Second Pod -2677 + -0116 -2306 -- -0082 -1798 + -0104 | -2019 + -0138 
Ovules Failing in First 
Pod and Ovules Fail- 
ing in Second Pod -0979 + -0123 -0684 + -0087 -1756 + -0104 | -0858 + -0143 
Ovules of First Pod 
and Seeds of Second | 
Pod ... pi Pe -2758 
Ovules of First Pod 
and Ovules Failing 
in Second Pod... ‘0178 + -0125 °0301 -- -0087 -0762 + 0107 | -1195 + -0142 
Seeds of First Pod | 
and Ovules Failing 
in Second Pod... | — :0570 + -0125 | — -0261 + -0087 | — -1095+ -0107 | -0344 + -0144 


- 0115 *2622 + -0081 -1173 + -0106 | -2574 + -0134 


The regression equations for the two Meramec Highlands series appear in 
Table B. The regression lines for the direct relationships 7,,,., 7s,s. ate given 
in Diagrams I and II, and for the cross relationships 7,,,,, 7s;o, 12 Diagrams III 
and IV. 


TABLE B. 


Regression Equations for Meramec Highlands Collection. 


2912 Pods 6000 Pods 
0, = 3-0571 + -35240, | 0, = 2-9925 + -35270, 
8, = 2-9563 + 2677s, | 8, = 3-0018 + -2306s, 
fo = -6171 + -0979f, | f, = -6722 + -0684f, 
S, = 2-4485 + -33650, | s, = 2:3982 + -32520, 
fo = °6089 + -01590, | f, = -5943 + -02750, 


f, = 8671 — 0453s, | f, = -7965 — -0192s, 


Inspection indicates that the regression of the ovules of the second pod 
(Diagrams I and II, upper lines) on the ovules of the first and of the seeds of the 
second pod (Diagrams IIT and IV, steeper lines) on the ovules of the first are 
linear. It is not so clear in the case of ovules or seeds of the second pod on seeds 


* All probable errors are calculated on the basis of the actual, not the weighted, number of pods. 


bE Sale U8 


Ovules of Second Pod. 


Seeds developing in Second Pod. 


J. ArtHUR Harris 205 


Diacram I. Direct Homotyposis. Cercis Canadensis (Meramec Highlands, Ist Series). 
Ovules on Ovules and Seeds on Seeds. 


Tablet 


Ovules of Second Pod. 


Table fil Ovules of 15 Pod 


Seeds developing in Second Pod 


Seeds developing in First Pod. 


Diagram II. Direct Homotyposis. Cercis Canadensis (Meramec Highlands. 2nd Series). 
Ovules on Ovules and Seeds on Seeds. 


Table // 


Table lV Ovules of first Pod 


1 2 3 4 5 6 7 8 
Seeds developing in First Pod. 


Ovules in Second Pod. 


Dracram III. Cross Homotyposis. Cercis Canadensis (Meramec Highlands, 1st Series). 
Ovules on Seeds and Seeds on Ovules. 


Table Vil 


Ovules in Second Pod. 


O 1 2 3 4 5 6 7 
Seeds developing in First Pod. 


Driacram IV. Cross Homotyposis. Cercis Canadensis (Meramec Highlands, 2nd Series). 
Ovules on Seeds and Seeds on Ovules, 


Table VIM 


0 1 2 3 4 5 6 7 8 
Seeds developing in First Pod. 
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of the first, although even here (see Diagrams I and II, lower lines, IIT and IV, 
less steep lines) the deviation from linearity is not great*. 

Returning now to the constants and considering them in detail, we note that for 
all localities the direct homotypic correlations are highest for the number of ovules 
per pod and lowest for the number of ovules failing}. The averagest are 


Ovules and Ovules ... = Pe as -3430 
Seeds and Seeds ts see ae ie 2164 
Ovules failing and Ovules failing ... os “1198 


The cross relationships are highest for ovules formed and seeds developing per 
pod; much lower for ovules formed and ovules failing to develop; lowest for 
seeds developing and seeds failing to develop—three of the four values for 7,5, 
are negative in sign. 


The averages are: 


Ovules and Seeds sai seen + -2168 
Ovules and Ovules failing Ae ae + 0712 
Seeds and Ovules failing ei ee — -0440 


From these data it appears that the individuality of the trees is more marked 
for number of ovules per pod than for either number of seeds matured or number 
of ovules failing per pod. 


Number of ovules per pod may be regarded in comparison with number of 
seeds developing and number of ovules failing to develop as an independent 
variable§. 


The number of seeds per pod is absolutely limited by (i) the number of ovules 
formed in the pod, and (ii) the number of these which are prevented from developing 
by peculiarities inherent in ovum or sperm, by accidents of fertilization, inadequacy 
of food supply, or other unknown causes. Or conversely, the number of ovules 
failing per pod is dependent upon the number of ovules formed and the number 
of seeds developing. Thus the number of ovules per pod determines within certain 
limits the number of seeds developing, or the number of ovules failing, per pod; 
but in the nature of things it cannot be influenced by either of these. 


Now since the three characters under consideration are correlated, i.e. since 
Toss Tors Yop have been found to have sensible values, it is clear that some homotypic 
relationship would arise for seeds because of the correlation for ovules, or for 
ovules failing because of the correlation for ovules and seeds. Thus a statistically 
significant value of an intra-individual correlation for seeds per pod does not 

* In the case of the relationship for ovules and ovules in the 60 trees from Meramec Highlands 
I have found the raw value of 7 to be -35336, exceeding r by only -00064. 

} In the Ohio series 7, ,, is very low; 1,8, and rp, f, are not sensibly different. 


t The constants for the 112 trees from Meramec Highlands are alone used in obtaining the general 
averages. 

§ This does not mean that in any given pod all three of these characters are not dependent upon 
the same ultimate causes, but merely that in a proximate sense number of ovules per pod is physio- 
logically independent while number of seeds and number of ovules failing are physiologically dependent. 
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necessarily mean that there are any specific biological factors influencing the 
number of seeds developing or the number of ovules failing in all of the pods of 
the same individual in a similar manner. The observed values of correlations 
for (semi)dependent characters may be merely the resultant of independent 
variables with which they are correlated. 


To remove the influence of 7,,,. upon 71;,;, We have recourse to the partial 
correlation coefficient for two variables, i.e. 0, and o,, constant. In our notation 
this ‘is 


srs (1 = Toros") = Torsiois: — Yoosi%ons2 + Fou ("015.7008 + %o187 0051) 
88 VT tno — tana fora t 2 oposTosnToan) WL — Foye Fons? Tora + YosFovesTone) 
102 2S1 1 $1 102° 0181 1 1 rf 12 S2, 182 102° 01 82" 0282 
But since we are dealing with symmetrical tables we may write* 
tyy (1 — Toto ) = 2Mostors + Yo’! (Tos” + To's”) 

a 1 = ryg = Tog? — = + 2K o'o'TosTo's! 

where the correlations without dashes are those for the same pod and those with 
dashes are for the relationships between different pods. 

In applying this formula to the actual data, the values of ,, for the individual 
pods are essential. These have been deduced from Tables X, XI below and 
Tables VI, VII of a former paperf. 

Meramec Highlands. 
6000 pods, 1,5 
2912 pods, 155 

Sharpsburg, Ohio, 7,, 

Lawrence, Kansas, 1,, 


9102 


o’o'N s's! 


“6855 + -0046, 
6936 + -0065, 
-4553 + -0086, 
6032 + -0091. 


: 6 é ® 
The partial correlation coefficients for seeds per pod for constant numbers of 
ovules per pod are: 


I 


I 


Series ofo'l's's! Tes votes 
8/3’ 
Meramec eee 
112 Trees at -1063 -2677 °397 
60 Trees ae -0700 -2306 304 
Sharpsburg, Ohio . 1645 -1798 915 
Lawrence, Kansas -0582 -2019 288 


These give a mean value of -0998 as compared with the mean relationship 
uncorrected for the influence of ovules per pod, ry, = -2164. 


Apparently, there are specific physiological } factors which tend to differentiate 


* Biometrika, Vol. vu. p. 328. For original see K, Pearson, Phil. Trans. A, Vol. cc. p. 31, 1902. 

+ Bull. Torr. Bot. Club, Vol. xii. pp. 243-256, 1914. 

{ Under the term physiological as used here are included (a) the ecological factors which determine 
whether an ovule shall receive a sperm, (0) the nutritional factors which determine the availability of 
food materials, and other environmental prerequisites for development, and (c) the innate vigour or 
other physiological characters of the individual which determine whether a, fertilized ovule shall develop 
into a seed. 
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the trees with respect to seed production, but the bulk of the interdependence 
for seeds seems to be merely the resultant of 7, and r,,*. 

Relative net individuality of the trees with respect to capacity for seed pro- 
duction is highest in the Ohio series. This is due to the fact that both r,,,, and 
7, are lower for this series than for any other. 

The frequency distribution of length of pod measured to the nearest millimetre 
in 50 pods each from 60 trees is shown in Table XII. The summed length for 
the individual trees, i.e. mean length x 50, is given in Table XIII. By the direct 
intra-class formulae I find 

ity = 4784 + -0095, 
a homotypic value distinctly higher than those found for the fertility characters. 


V. Discussion or RESULTS. 


The only homotypic constants available for fertility characters in plants are 
those furnished for several species of- Leguminosae by Pearson and his associates f. 
They give the direct correlations as shown in Table C. 

TABLE C. 
Homotypic Correlations for Fertility Characters. 


Species of Legume Ovules ere Ripe Seeds 

| Cytisus Scoparius — _- 4155 
| Lotus Corniculatus _- = +2354 
| ” ” een —=. -1884 
Lathyrus odoratus -2182 -2679 0830 

” 29 -3658 +1759 -2091 
Lathyrus Sylvestris -1695 -1376 -2184 

” ” = a -1877 

Vicia Faba ae +1724 -1493 -1243 
Vicia Hirsuta S68 +2315 -1827 -2077 


In addition to these values I have found in a short series of only 12 trees 
of the arborescent legume Robinia Pseud-acacia worked out as an illustration of 
method{ the values 7,,,. = -452, 15,6. =°449, 1o1s, = ‘080- For a short series 
(23 plants) of Cytisus Scoparius§ I have found r,,,, = -198. 


The average homotypic values for ovules in Cercis are distinctly higher than 
the comparable values for other species hitherto adequately investigated. The 


* In Sanguinaria (Biometrika, Vol. vit. p. 328) the correlation for the number of seeds on the two 
placentae of the same fruit seems to be chiefly due to physiological factors. In this case an organic 
correlation is superimposed upon a homotypic. The point may profitably be discussed comparatively 
when other data now in hand are completely analyzed. 

+ K. Pearson and others, Phil. Trans. A, Vol. cxcvu. pp. 364-379, 1901. 

{ Biometrika, Vol. 1x. pp. 456-458, 463, 1913. 

§ Amer. Nai. Vol. xiv. pp. 566-571, 1911. Possibly this material is of closely selected ancestry. 
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mean values for seeds are about the same as those found by English investigators. 
The mean correlation for ovules failing is lower. 

The homotypic correlation for seeds matured per pod is statistically largely 
a resultant of the homotypic correlation for ovules and the “organic” correlation 
for ovules and seeds of the same pod. Thus homotypic correlation for seeds per 
pod does not entirely disappear when correction for 7,,,, and 7,, is made by the 
application of partial correlation formulae. There are, therefore, (proximately) 
independent ecological and physiological factors tending to differentiate individuals 
with respect to capacity for seed production. 

The general rule for fertility characters in Leguminosae seems to be for the 
maximum value of the homotypic correlation to be that for number of ovules 
per pod, much lower values are found for seeds matured or abortive ovules per 
pod. I believe that the evidence also points in the direction of a lower correlation 
for ovules than for the more truly vegetative characters of the plant. Thus 
Pearson’s average values for r,,,, are far lower than his average for leaves and 
other similar organs. The result given above for length of pod r,,;, 18 practically 
-500, and distinctly higher than 7,,,, = :352 from the same habitat. 


102 


TABLE I. 
Ovules per Pod. 
2 3 4 5) 6 7 Totals 
3 l 
a 2 18 Tel 19 7anls 46 ee © 375 
is 3 111 836 2382 | 942 2300 eats 4525 
2 4 197 | 2382 | 11770 | 8642 2599 160 | 25750 
5 46 942 | 8642 | 12134 5613 | 598 | 27975 
8 6 3 239 2599 5613 3742 | 529 | 12725 
= 7 0 15 | 160 598 529 | 148 1450 
jo) | 
Totals 375 | 4525 | 25750 | 27975 | -12725 |. 1450 | 72800 
i TABLE II. 
Ovules per Pod. 
2 3 4 5 6 7 | & | Totals 
ro | iB aye Pe | | | 
ee 2 98} 1025 1775 419 49; O| Of 3366 
eH 3 1025 | 10608 | 25134 | 11257| 2300) 111 | 5] 50490 
5 4 1775 | 25184 | 101272 | 74059 | 18242 | 1296 | 31 | 221859 
al 6 419 | 11257 | 74059 | 92240 | 37849 | 4012 | 142 | 219978 
PAG 49 | 2300 | 18242 | 37849 | 25458 | 3833 | 181 | 87912 
aor cad O| 111} 1296] 4012] 3833) 712) 35] 9999 
EC gen.8 0 5 | 31 142 || 18g || 5 bahay? 396 
3 | | | : 
Totals se | 50490 | 221859 | 219978 | 87912 | 9999 | 396 | 594000 


Seeds per Pod. 
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TABLE III. 
Seeds per Pod. 
TA, Oe Pe 3 4 5 | 6 | 7 | Totals 
3 1 16| 95 179 | 207 94/ 9| O| .600 
aw 2 95 | 710| 1596) 1691| 693 152, 13] 4950 
3 179 | 1596 | 5106 | 6260 | 2975 747 | 62 | 16925 
a 4 207 | 1691 | 6260 10164) 6106 1821 151 | 26400 
5 94 | 693 2975 | 6106 | 4986 | 2109 | 212 | 17175 
& 6 9 152 747 | 1821 | 2109 1096 | 166] 6100 
2 7 Galler e109 62 | 151 212 | 166| 46] 650 
TM | 
Totals | 600 4950 | 16925 | 26400 | 17175 6100 | 650 | 72800 
TABLE IV. 
Seeds per Pod. 
elas? 3 Aes | 085 6 7 | 8& | Totals 
| | 
fi 394 | 1467/| 2965) 2861 1412 | 283] 23] Of 9405 
2 1467 | 7224 | 17224 | 16294] 7546 | 1520| 104| 2] 51381 
3 2965 | 17224 | 47594 | 52375 | 27161 | 6032] 388| 8 | 153747 
4 2861 | 16294 | 52375 | 71576 | 45505 | 12278 | 1041 | 30 | 201960 
5 1412 | 7546 | 27161 45505 | 36836 | 13121 | 1304 | 72 | 132957 
6 283 1520! 6032 | 12278 | 13121 6334] 853 70] 40491 
7 23| 104 388 | 1041 1304 | 853] 134] 14] 3861 
8 0 2 8 30 FOS MeO id 969 198 
| 
Totals | 9405 51381 | 153747 | 201960 132957 40491 | 3861 | 198 | 594000 
TABLE  V. 


Ovules failing per Pod. 


Ovules Failing per Pod. 


Totals 


19140 | 12531 


12531 | 9536 
3620 | 3182 | 1206 | 313 
756.| 692| 313] 134 
91 OF weasel «30 

12 


36150 | 26050 | 


0 1 2 


12 1 0 


Totals 


36150 
26050 
8375 
1925 
275 
25 


72800 


14—2 
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Ovules Failing per Pod. 


TABLE VI. 
Ovules Failing per Pod. 
0 1 2 3 | d 5 | Totals 
——— 
0 143232 | 101090 | 32493 | 6712 | 1408 | 185 | 285120 
1 101090 | 78862 | 27035 | 6332 | 1227 | 185 | 214731 
2 32493 | 27035 | 10220] 2735 596 | 82] 73161 
3 6712 6332 | 2735 884 233 | 383] 16929 
4 1408 1227 | 596 233 | 92 8 3564 
5 185 185 | 82 8 2 495 
| | 
Totals | 285120 | 214731 | 73161 | 16929 3564 | 495 | 594000 
\ i 
TABLE VII. 


Seeds per Pod. 


| Totals 
= 
a 2 375 
i 3 4525 
a 4 25750 
3 5 266 | 27975 
& 6 12725 
Ss 
5 7 1450 
=), 
| Totals 650 | 72800 
TABLE VIII. 
Seeds per Pod. 
1 2 3 oT GS 6 7 | 8 | Totals 
2 98 658 1407 953 | 234 16 0 0 3366 
3 1158 | 7868 | 19077 15761 | 5710 882 34 0} 50490 
4 4206 | 23467 | 67408 | 78277) 40285 7727 | 485 4 | 221859 
5 3066 | 15160 | 49891 | 76785 | 56232 17265 | 1525 | 54 | 219978 
6 s00 | 3923 | 14687 | 27312 | 27097 | 12491 | 1492 | 110] 87912 
Wy, 72 294 1238 | 2793} 3260) 2005] 309} 28 9999 
8 5 in 39 | 79 139 105 16 2 396 
| | 
Totals | 9405 | 51381 | 153747 | 201960 | 132957 40491 3861 198 | 594000 
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Seeds per Pod. 


TABLE IX. 
Kansas Series Ohio Series 

: Total | Total Total Total 

Tree Ovules | Seeds Ovules Seeds 

=(0’) | d(s’) >(0’) > (s’) 

o> 
1 522 461 740 592 
2 471 377 690 578 
3 420 | 362 86] 731 
4 473 412 791 605 
Sem 2400 351 813 576 
6 480 413 8381 504. 
"i 456 368 826 477 
8 591 469 809 519 
9 454 346 753 329 
10 574. 491 843 546 
11 429 380 815 490 
12 487 382 879 633 
13 482 407 762 606 
14 552 465 787 453 
15 502 418 769 566 
16 473 411 863 _ 632 
17 396 338 746 641 
18 564 408 1039 (Bul 
19 614 521 707 510 
20 495 437 962 731 
21 536 435 886 696 
22 444 403 788 649 
23 — — 806 692 
24 | — — 800 632 
Ns, a eee Ea 925 651 
26 — — 932 612 
TABLE X. 


NSD GH KR Oo 0 


|: 2 3 Ae eno) (ae | ey oree 
| 
| 


120 sak 
380 553 
147 421. | 510 
29 80 169 
1 ee | 8 
| 
677 1056 | 687 
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TABLE XI. 


Seeds per Pod. 


| Ear? 3 | 4 | 5 6 7 8 | Totals 
| | 
ic 2 an ws - 
Ay 3 34. 176 300! 4/4 _ 
wi 4 37 226 | 885 | 1093 
5, 5 17 71 310 784 1040 
a 6 4 14 54 1.8 * 168 288 375 =3 = 
< 7 ee 1 4 9 15 33 39 an 
5 8 | | eee 1 ee 2 
o) 
| Totals | 95 | 519 | 1553 | 2040 | 1343 | 409 | 39 | 2 


TABLE XII. 
Length of Individual Pods. 


eee! f L if L f L f L a L ip 
ee ee | 2 =e) 
45 2 55 13 65 141 75 213 85 53 95 — 
46 1 56 12 66 83 76 152 86 28 96 2 
47 1 57 19 67 104 77 122 87 26 97 3 
48 = 58 | 19 68 128 78 117 88 18 98 2 
49 ] 59 13 69 56 79 53 89 3 99 1 
50 ] 60 54 70 222 80 187 90 21 100 3 
51 2 61 39 71 113 81 82 91 u 101 3 
| 52 3 62 54 72 162 82 86 92 11 102 il 
| 53 4 63 51 73 154 83 64 93 5 103 = 
| 54 5 64 63 74 158 84 56 94 1 104 2 


TABLE XIII. 
Total Lengths of 50 Pods for 60 Individual Cercis Trees. 


| 3361: | 3587 | 3004 | 3499 | 8598-37389 | 3418 | 3848 | 3202 

| 3679 8794 3352-3707 | -3615 | 8463-3945 | 3388 | 4026 | 3642 
393 4546 | 3654 3658-3823 | 3484 | 3806 | 3977 | 3244 | 3459 
3548 | 3748 | 3343 | 3686 | 3619 | 3978 | 3725 | 3544 | 3223 | 4086 
3693 | 3725 | 3168 | 3873 | 3889 | 3279 | 3502 | 3053 | 3765 | 4031 
3883 | 3850 | 3770 | 3921 | 3497 | 3791 | 3749 | 3603 | 4117 | 3590 
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ON THE PROBABLE ERROR OF A COEFFICIENT 
OF CONTINGENCY WITHOUT APPROXIMATION. 


By ANDREW W. YOUNG, M.A. anp KARL PEARSON, F.R.S. 


(1) Introductory. 


There have been two memoirs dealing with the probable error of a coefficient 
of contingency, namely that by Blakeman and .Pearson in 1906* and that by 
Pearson in 1914}. In the former paper the authors started from the expression 
for the mean square contingency 


and varied ,,, n,, and n,, but neglected the squares and products of these varia- 
tions. The result was lengthy, and the arithmetical work laborious. In 1914 
Pearson gave reasons for considering n,, and n,, as constant during the sampling 
and got a much simpler value for o,,. The result in actual numerical cases did 
not differ widely from the much more elaborate formula of the earlier memoir. 
Recent work in other directions has, however, shown that caution must be used 
in neglecting the square and product terms of the variations due to random sampling, 
and the object of the present paper is to consider the variation of 4? on the hypothesis 
of Pearson’s 1914 note but without approximation. 

Let a population of size M be grouped into c divisions—for example, the cells 
of a contingency table—and let the contents of the sth division be m,. Let a 
sample of size V be taken at random from the population and let n, be the contents 
of the sth division according to the same grouping. 

We shall here consider the variation of the quantity 6? defined by 

; nN,” F 
1+¢7=8 ae Pee Aaah ieee Ree e ee (i), 

where A, is a number connected with the sth division and is for the present restricted 
only by the condition 


ISAC NG) at Sete ane aati eta des sana cuetetennvs eas (ii), 
—a condition which enables us to write 
a a {(M@= — As)?) 
= 5 { WA. (rte (111) 


as equivalent to (i). 


* Biometrika, Vol. v. p. 191. + Biometrika, Vol. x. p. 570. 
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These undetermined numbers A, are thus in general of the nature of weights and 
may be chosen in a variety of ways. The most important particular case is that 
of the population being grouped in a contingency table with, say, two variates. 
MyM, 
He 
and m, being as usual the marginal totals of the wth row and vth column of the 
population M, ¢? will be the mean square contingency*. Other cases will be 
discussed later. 

The object of the present paper is to investigate the variation of the quantity 
g? as determined from the samples of the population. We take the numbers A 
to be a property of the whole population and accordingly to have no variation as 
long as the size N of the samples is constant. It is true that in most cases in practice 
there will be only one sample and that the values of the numbers A will have to 
be deduced from that sample and will therefore deviate from the values which 
would be used if the sampled population were known. But what we are seeking 
is the variability of the samples on the understanding that the distribution of 
the whole population is definite although in practice we know only the approxima- 
tion to that distribution which is given by our sample. If we had wanted the 
variability of the calculated values of 6% deduced from a large number of random 
samples, then we should have taken into account the variation of the 1’s as well 
as of the n’s. In this lies the difference between the discussions in the two earlier 
papers of 1906 and 1914. 


This investigation follows that of the second paper, but we shall here give the 
full expressions without approximation, i.e. without neglecting the square of dn, 
as was done in 1914. It will appear from the numerical examples worked out 
later that this squared term makes a fairly great difference and, even if this were 
not so, it is always preferable to have such formulae in full in order to decide the 
legitimacy of neglecting any terms. This is especially the case in statistical theory 
where neglect of the later terms of a Taylor expansion often leads to false results. 


The sth division will be, say, the cell (uw, v) and if A, be taken to be N My 


(2) Mean Value of ¢?. 
Let $2 be the mean value of ¢? and let 7. be the mean value of n,, i.e. the 
value which would be given by taking a very large number of samples. Then we 


can write 
Ne Ms 


Nes 
Also if we define 5¢? and 6n, by the equations 
2 = f+ S42, 
N, =n, + Sng, 
7 2 n 2 
(EE Se tal +28 Gr) LO (SR) ee (iv). 
* Drapers’ Research Memoirs, Biometric Series, I. On the Theory of Contingency, etc., Cambridge 
University Press, 1904; Biometrika, Vol. v. p. 191, 1906; Biometrika, Vol. x. p. 570, 1914. 
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Sum all such equations for a large number of samples and divide by the number 
of samples. Then, since Mean 6n, = 0 and Mean 667 = 0, we have 


1+¢=S Grr) + MeanS (Ge) Peat rae vet (v). 


The expression Mean (8n,)? is typical of several which we have to use in what 
follows and it will be useful to state or prove all the needful formulae before 
proceeding further. 


(3) Formulae regarding Products of Deviations. 


The deviations 6n, arrange themselves according to a hypergeometrical series 
and the moment coefficients of this series are known to be* 


Py = xu N pg 
Bs = X1X2N pq (p — 9) 
Ha = Xi N pg (3x3N pg + xa) 


N-1 
where yal alae eee 
(ONS), oa oe ee ny eC (v1), 
eae | REO 


Xo = (1- x) {1 A i = oa | He st 
2 


and in the present case 
LN 
cM aN : 
Sees i { A SECS er Ree eines Orr rae (vil) 
M N | 


When M is very large as compared w.th N, as in the majority of cases in 
practice, we may write xy; =x,=x3=x,= 1. 


We can now make use of these formulae to derive the following: 


(2) Mean (6n,)*. This is p, in the notation of (vi) and 


= Xs ( = =) ESOC SSE COSONCL CECE ere (a). 


(0) Mean dn,dn, where s and s’ differ. Suppose first that 5x, remains constant 
and investigate the mean of dn, for this constant value of 5n,. Now the distribution 


* Pearson, Phil. Mag. 1899, p. 239; Biometrika, Vol. v. p. 174. 


218 On the Probable Error of a Coefficient of Contingency 


of ny for n, constant is clearly given by the sub-hypergeometrical series with 
N’ = N — ii, — bn, as total population and 


em ng 
PN ag 
1 ar ae Ny 
t= Meaeeane? 
so that the Mean dn, for dn, constant is 
Ny dn, Ny 
eee Sampara ee oe ON ise 
(N — n, — 5n,) Nea Nig Neate 
Hence 
dn,n 
Mean 6n,dny = — Mean Ons. = 7, 
= — Mean 6n2 seis 
“"N — it, 
A,\ ity 
=—y,n, (1 — = — from (a 
xalie(1— 3) qt from (a) 
igiiy 
i X1 a a Cee ee cer ec ecee eres ere esee eee eeecceeee essere (b) 
(c) Mean (6n,). Directly from (vi) and (vii) 
: i On, 
Mean 6n,? = x1, 75 (1 - a (1 — x) ia eaters (c). 


(d) Mean (6n,)? 6ny. Using the process of double summation as in (b) we 
have 


dN, Ny 
PRN ON nme i \ ee wases 
Mean (6n,)? 62, Mean (6n,) iN =e 
iy e i, on, 
= Na, X1% is ( = 7") (1 = *) from (c) 
fiji on, 
= DO =e G a =F) weve ccecceecccces see c er eccecscces .(d). 
(e) Mean (dn,)*.. As in (c) we have immediately 
f Tee ii, 
Mean (6n,)* = yi, (1 — =| (3x2 Ns (1 — 7) AF x1) duinclastaneae (e). 


(f) Mean (6n,)? (6ny)?. We again use the double summation as in (0), but in 
this case the algebra is much more troublesome. From the constants of the 
sub-hypergeometrical as given in (6) we have 

n, — dn, — 1 


VATA Nia N= s a ns ng 
EN OO) =(1-° M—m,—1 WET SO pees (ee 


* Since S(én,) must be zero, we can regard the Mean én, for a given dn, as being the result of a distri- 
bution of a deviate — 6n, distributed over all the divisions except the sth. The portion due to the 


Ns! 


s‘th is then Von.” (-6n,), as obtained above. 


ANDREW W. YouncG AND Kart PEARSON 219 


But this is the Mean (8’n,)?, where 5’n, is measured from the mean of n, in the 
case where 8n, is fixed and we must reduce to the general mean, i.e. where dn, is 
not given, to obtain the mean value of (ny)? for constant dn,. This is done by 
adding the square of the difference between these means of ny, namely Dore 
We thus obtain that the Mean (6n,)? for constant dn, 
N —ii, — 6n,— 1 H iene. Ty _ tg® (6n,)? 
= u Sia 1 ) cee) N — ii, ¢ N — =) ' (N — n,)? 
ty (1 x) ( an) Non, N? (dn,)? Mn, (N — ni, — Ty) 
= Ta tear Wins 7) == Oi (Sear | a 
M M/M(N—ji,) M?(N — v,) ye & (Non 1) 


— 7, 


nj? (8n,)? 


(NV -: Rs)?’ 


when we substitute Mn,/N for m,. 


Thus we have to evaluate 


Mean (Sn,)*8ne)®= {44 (1— 3p) Mean (Bn)#— (1 — 37) 4) Mam On 


mM M M/M N-—-i, 
N? Mean (6n,)*) M?n, (N — n, — ny) ny? ee 
— iP Wa ee eS A 2 , Mean (6n,)*. 


enw onj=n) “= s) 


Substituting from (a), (c) and (e), we find 


A (Gn BEN Ns) 


Mean (6n,)? (dn)? = x1 Msg i a 
ae Ne (= (N —7,) -1) 


( QN\ M(N ~2%,)(N-i—7y)_g _Te(N =~ Fy) 
Xe 9) + NO Vo. Re aN 
PS) (aioe 7 i pasa Say Vis 
N (F =) 1) Ny Ce 1) 
N — it, — fg 3 TiS . ny 
X4 iP OGY aye) A ee 
(W — 1) (5 (N — 2) 1) Oe yaa 


This expression must be symmetrical in s and s’ and this will be the case only 


if the quantities a = n,)— 1 and N—~7x, in the denominator cancel with 


factors in the numerator. By taking the two terms in y, together we get rid of 
the N — n, factor and after a Jaborious expansion in substituting the values of the 
x’s we reduce the whole expression to the comparatively simple form 


fi,jiy figt ily | 3iieily 
Xi {xsi (1 Nees Na ) + xs} Peer ee ee (i): 


This agrees with the value obtained by Isserlis from the differential equation to 


the hypergeometric series and thus confirms his result obtained by a totally 
different procedure*. 


* Proc. Roy. Soc. Vol. xctt. p. 28. Our notations are different. 
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With these formulae we can now proceed to discuss the mean value of ¢? and 
its variability. 


(4) Mean Value of 6? (continued). 
At the end of (2) we had arrived at the equation 


14 6 = 8(5) + Mem s (Sy), 


and substituting from (a) we now obtain 


Fe ) ol Ns ( RE x) ae 
1+¢=S can + Xi S v are peice (vili). 
We can usually put x, = 1 and write 
2 1 me? Ns lene 7a 
1+ ¢= (1 x) S (ae + Hy S & ee (ix). 
In the particular case of the contingency table 
= 1 Tawa 1 Nii 
oe ice Uv fae _ Uv 

Tone (1 a) s Ee tas =) a (x). 


Now S (7) - —1+¢,°, where ¢,? is the mean square contingency for the whole 
UU 


population, so that the mean value of ¢? as determined from a large number of 
samples is in excess of the true mean square contingency by 


vy (S(72") - 0 +443). 


(5) Standard Deviation of 42. Non-approximative Formulae. 


From the equations 


vg 9(H8) + F(R) = 3960") 


ae Wee il Ng Ns 
and l+¢ = 8(Fr) tx 84 (1-H 
én,)? NON, Ns Ns ; 
we have NS¢2 = 8S (‘ i. ) 258 ( 7 \- x15 iF (a — ny ioe vate (xi). 


Squaring, summing for a large number of samples and dividing by the number of 
samples, we have 


N2o?42 = Mean N? (6¢?)? 


=n (5 YY + 46H + xe OE O-9) 
15 (00). ($4) ays ffe(. 9.) 


bo 
Ww 
— 
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eee Is (ee is s sg (Gata) ie, s eZ <2 (8n,)? 


Xe re ds? 
aA, one (dn,)* 7, | , ((dn.)? NydN, 
+ 4s (Tego) +48 (Sur) +488 (“ar”) 


n,?(N — 7n,)? Rete (No = tts) (= Ts) 
— 88 (ia an e xP SS (755 N? ie 


where . denotes the same as S, i.e. summation for all values of s, and SS denotes 
ss 


Pnniition for all values of s’ except s = s’ followed by summation for all values 
Olt. 


Substituting from the equations (a), (5), (c), (d), (e), (f), we have 
i,(N — fi, i,(N — i, 
Beste (5, B= 


ere 
1 Ng Ne! Netny | 3n,ny 
#98 yo x yt (oN (1 +e) + xe)| 


n2 n,(N —7,) Mente Siang 
+48 {yax N 18.8 {ro Xa N 
Ns n,(N — n,) (N — 2%,) ne nny (N — 27,) 
sess i X1X2 ee) YA, X1X2 Nz 
n,2 (N — n,) nny (N — n,) (N — ny) 
axe 8 is oye | MSS Re ys | 
IP z EN 
i © ee | Xai oat (8x. Sree aa x —%) ee la (4 — (6x3 + 12x. — 2x1) n) ns? 


4 jie. 
Se (- ete (3x3 + 8x2 — x3) nN? s) ns] 
ra ee NeNe Dear s! 
+ S518 | (xe — xt Ht) Rae + Can — xa 2a) EEE) 
4 F Ne as © 
ot (- wt (8x3 + 8x2 — x1) wi) nen] 


Now it is evident that in numerical work the double summation would involve 
much extra labour, but we can get rid of it by using the identities 


(33) = 8 Gs)+ 8s Gan) 
s (i) s()-sG3)+8sG) ~$G3) +4885 
(s(8)) 363) +35 C882). 


and so reducing all to single summations. 


after expansion and rearrangement. 


* As this notation may be somewhat unusual, it may be better to make it clear by taking a case 
with three variates only, for example: 


(Sng)? = (Ny + Ne + Ng)? =Ny? + Ng? + Ng? + (My + Ng) Ny + (Mz +24) Ny + (Ny + Ng) Nz = Ped 248 S NsMs 
s ss 
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This leads to 


4 — (Xs is Bo) 9 & 


ean at + (3x3 + 8x2 — x1) mi) {8 Gals (il): 


(6) Standard Deviation of 4%. Approximate Formulae. 


The result of the preceding section is an exact one since we have neglected no 
terms in arriving at it, but as mentioned before we can usually take M to be very 
large compared with N and make xy, = xy. = x3 = xX, = 1. With this simplification 
equation (xii) becomes 


son [O)-(RF] 
+7 aus oi i) 48 (ax) 8 & ) = 10 {s Galt — 128 (mi) 
a E (5) ! \s Ge 2 Gs) | sk ielaiomes (xiii). 


In the great majority of cases it will be impossible to make rigorous use of this 
formula since we have no other knowledge of the whole population than what 
is given by the sample. In particular the 7’s are usually unknown and we must 
simply make use of the approximations at our disposal, namely the n’s of the 
observed sample. 

Again, it will usually happen that while 7, may be fairly large 7, — A, will be 
small and it will give formulae which are much more convenient for computation 
if we write 7, = 7, — A, and substitute #, + A, for 7, in equation (xiii), remembering 
that S (A,) = N. 


After some reduction the formula becomes 
9 4 p.? I 
SPN E (wi ) poe # 
i 72 [68 (5) + +0 — 443) (¥) 1: 28 (77 .) + (2—442)¢—22—566?— 109 | 


+ LSC) 9G) = eas) +20 —a9 (0) roomy 


ahaa eae (xiv), 


where c is the number of classes or categories in the population in question. 
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(7) First Application. Contingency. 


As mentioned in (1), if we regard the sth division as the (u, v) cell of a contin- 
gency-table and if we take 


My My — NyNy 
M2. ONS? 
m2 
1+¢=8(**), 
and (n -- are) 
ges (ean 


lis Mean square contingency. 


= 


then 


Accordingly, with the notation 


Ni Uy 


A,= > 


and 


[(n - nate)'| 

be? OA cama Say | 

Oe a laa) 

the equation (xiv) gives the standard deviation of the mean square contingency, 
when MM is very large as compared with NV. 

The terms enclosed in the first bracket of (xiv) are exactly those of Pearson’s 
1914 paper in Biometrika, so that the second and third brackets contain the terms 
arising from the squares and higher products of 6n,. 

Of the total correction due to the higher approximation it is of interest to find 
how much is due to the change of mean and consequently the change of origin of 
d¢?, when the square of dn, is not neglected. The true mean is given by 


— 1 Noe 1 n n 
i 2 s I s Ss 
1+ = 7 S() + HS {EC nyt 


and using the observed values of n, as the best approximation available for 7, 
= i Lg fis + As) 
Ze 2 | vs Ss 
1+ $= (1+9%)(1-y) + 75 a 
L fo (tt 
= { 2) 4 s 2 pe. 
(1+ $4) + 5 48 ($4) POL it, 


so that the difference between the true mean and the approximate mean obtained 
by neglecting squares of dn, is 


gt [8(f) a be Ip. 


In accordance, then, with the formula for change of second moment with change 
of origin we get the effect of the change of mean on o%4: by subtracting 


wa 18(s) s+ e—1P 


from the approximate value. 
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In the examples given below it will be seen that this is only a small part of the 
total correction and thus the main part of the correction is due to the retention 
of the squares and products in the value of (642)? used in (5). 


(8) Numerical [llustrations. 


I. Contingency between Handwriting and Intelligence in Girls. 


The probable errors of the contingency constants in this table have been worked 
out both in the 1905 and in the 1914 papers and below is given a table showing the 
effect of the corrective terms of the present discussion. 


The new summations required are found to be 


: (%) es iS () -= 15982, 
(2) _ 93.1. NY) _ 7. 
S ee = 93 144, S ( a ae 57270, 


and in the following equation the numerical values of the various terms of equa- 
tion (xiv) are ae in the same order as their corresponding algebraic terms: 


a, = oa [-14865 + -09580 — -00918] 
4 aay [558-864 + 97-404 — 1-784 + 58-205 — 22 — 5-365 — -092] 
os amr [57270 + 15982 + 164 — 186 + 870 + 1224]. 


The other calculations are summarised in the table below: 


TABLE I. 


Ce 5 oy) 
$? = 09580, O, = = / RR 2957. 


Various formulae used 


Blakeman and Ist Term of Ist and 2nd le Adena ster 
| Pearson | (xiv) or Pearson +) | (<i 
(1905) (1914) | Terms of (xiv) (xiv) 
| Tyr ++ tee oo -02023 02286 -02709 | -02729 
| Probable error of ¢? | 01365* | 01542 01827 -01841 

(a, abo fe Ra -0285 03219 “03815 03844 
Probable error of C ‘0192 02171 -02573 02593 


In this table the work has been carried out to four significant figures with a view 
to showing the corrective effects of the various terms. It is apparent that the 
fineness of approximation given by (xiv) in full is more than is required in practice, 


* Incorrectly given as -0042 in Blakeman and Pearson’s paper, loc. cit. footnote p. 196. 
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but there is a considerable difference between the values given according as the 
second term is used or not and it seems that in some cases it would be advisable 
to calculate this term or at least the most important terms in it, viz. 


Ne a (58 (5) + 8S (¥:) + )+ 2c) ors ak aac Re ae ent (xv). 


Using this approximation in the above case, we obtain og: = -02730, a result 
which—by a mere chance, of course—is almost exactly that given by the full 
expression in (xiv). 

II. Contingency between the Hair-colours of Pairs of Female Cousins. 

In the example just given the total number N in the sample was fairly large, 
viz. 1801, and it might be expected that in smaller samples the corrective terms 
would be of increased importance. With a view to testing this a contingency- 
table given by Miss Elderton in her Memoir on “The Measure of the Resemblance 
of First Cousins”* was selected. There are 36 cells in this table and the total 
number in the sample is only 218, there being several cells with zero or very small 
content. 

The Table is given in full on p. 226 along with the quantities required for the 
calculation of 7 and oy; it is there evident from the figures how large a proportion 
of the variability depends on the cells of small content. This is of course to be 
expected but the importance of having large numbers in all the cells is not always 
appreciated. In this particular case physiological reasons would prevent us 
from clubbing together the “Fairs” and “Reds” and with the fewness of the 
observations at our disposal we must use the table simply as it stands. 

The scheme followed in each cell of the table is shown in the last column, 
and in the marginal tota!s are given the values of all the summations required 
for (xiv). These are 


2 
gee Sie ) = -14895, iS (52) 5 536:8. 


NA, A,? 
2 

s(#)=—1sis1, 8 \(*) | = 19-162, 
se 

Ss (+) = 5170-9, S (755) = O82 (12 


When these are substituted in eauaoOn (xiv), we have, preserving the algebraic 
order as ae 


C23 — r [08277 + -14895 — -02218] 
=F oe 5 Lda: -9714 — 13-6837 — -9932 + 50-5515 — 22 — 8-3411 — -2219] 
TES ee oaort 5170: : 3 195-7 294) 
+ (gygys [— 23368 + 5ITO-9 + 3-4 — 38-3 + 125-7 + 1224), 


and the whole work is again summarised in Table IIT (p. 227): 
* Hugenics Laboratory Memoirs, tv. Cambridge University Press 1907. 
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Second Female Cousin 
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TABLE II. Contingency between the Hair-colours of Female Cousins 


First Female Cousin 


Tints very, ee Brown! meee Fair Red Totals 
9 11 3°5 11 1°5 O 36 =Sn, 
5-945 | 9-248 6-523 9-495 3-963 0-826 Pe (Ag) 
36-67 23-57 33-42 22-96 55-00 263-93 435-55 =S(N/\;) 
Very -5139 1894 — -4634 “1585 — +6214 — 1-0000 —1-2230 =S (y,/A,) 
Dark 2641 0359 2147 0251 3861 1-0000 1:9259  =S (W2/Ae 
18-84 4-46 ~ 15-49 3-64 —~ 34-18 — 263-93 — 286-66 =8 (Nw,/A,2) 
00720 00152 00642 00109 -00702 00379 02705. =S (W2/NA,) 
00370 -00029 --00298 00017 — 00436 —-00379 —-00697  =S (W.3/NA,?) 
11 13 12 10 9 1 56 
9-248 14-385 10-147 14-771 6-165 1-284 — 
23-57 15:16 | 21-48 14-76 35:36 169-78 280-11 
Dark 1894 — -0963 -1826 — +3230 -4598 —-2212 1913 
Brown 0359 0093 0333 1043 2114 0489 4431 
4-46 —~1-46 3-92 -4:77 16-26 — 37-56 — 19-15 
00152 00061 “00155 00707 -00598 00029 -01702 
00029 — -00006 00028 — -00228 00275 — -00006 00092 
3°5 12 8 9°75 3°25 3 39°5 
6-523 10-147 7-157 10-419 4-349 0-906 — 
33-42 21-48 30°45 20-92 50-12 240-61 397-00 
Brown — -4634 1826 -1178 — -0642 — +2527 2-3111 1-8312 
“2147 -0333 -0139 0041 -0639 53412 5-6711 
— 15-49 3-92 3-59 1-34 — 12-67 556-07 534-08 
00642 00155 -00046 00020 00127 | -02220 03210 
— -00298 00028 00005 —-00001 — 00032 05131 04833 
11 10 9°75 16°5 9°25 1 57°5 
9-495 14-771 10-419 15-166 6-330 1:319 = 
22-96 14-76 20-92 14:37 34-44 165-29 272-74 
| Light “1585 ~ +3230 —-0642 -0880 4614 — +2419 -0788 
| Brown -0251 -1043 -0041 -0077 +2129 -0585 “4126 
3-64 —4-77 — 1-34 1-26 15-89 — 39-98 — 25:30 
-00109 -00707 -00020 00054 -00618 -00035 01543 
| 00017 — -00228 —-00001 00005 00285 — -00009 00069 
1:5 9 3°25 9°25 1 (0) 24 
3-963 6-165 4-349 6-330 2-642 0-550 — 
55-00 35-36 50-12 34-44 82-51 396-38 653-81 
— +6214 -4598 — +2527 -4614 —-6215 — 1-0000 ~ 1.5744 
Fair 3861 2114 0639 2129 3863 1-0000 2-2606 
— 34-18 16-26 — 12-67 15-89 —51-28 — 396-38 — 462-36 
00702 00598 00127 00618 -00468 00252 02766 
—-00436 | 00275 — -00032 00285 — -00291 — -00252 — 00452 
ft) 1 3 1 ft) | 0 5 
0-826 1-284 0-906 1-319 0-550 0-115 — 
263-93 169-78 240-61 165-29 396-38 1895-66 3131-65 
Red — 1-0000 — +2212 23111 — +2419 — 1-0000 — 1-0000 — 1-1520 
1-0000 0489 5:3412 0585 1-0000 1-0000 8-4486 
— 263-93 — 37-56 556-07 — 39-98 ~396-38 —~ 1895-66 — 2077-44 
00379 -00029 -02220 00035 00252 -00053 02968 
—-00379 ~ -00006 05131 — -00009 — -00252 — 00053 04431 
Totals Marginal Totals are the same as for vertical margin a = ae 


S(N/d,) =51709; 


S (Ws/As) =x 


— 1-8481; 


S (5?/As?) =19 


162; 


3 (Nypa[Ax2) = — 23368 ; 


S (q,3/VA,2) =-08277. 
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TABLE III. 
; 7 fp? apes 
db? = -14895, C= Tee a -36005. 


Ist Term of Ist and 2nd Terms All Terms of 
equation (xiv) | of equation (xiv) equation (xiv) 


Os aoe Te 20 062006 079848 082317 
Probable error of ? “041822 053857 | *055522 
Oi ode ele Ar -16902 | *21765 *22438 
Probable error of C | -11400 -14680 *15134 


The relative importance of the three terms of equation (xiv) is not markedly 
different in this table of small total content from what it was in the case of the 
Handwriting-Intelligence table and we cannot base different conclusions on the 
two cases. 


Again, using the approximation given by selecting the large terms from the 
second bracket of (xiv), viz. 
ips” bs\ . « 
Se) 
we obtain Oy: = 0864, 


which as in the previous example is a reasonable approximation to the full expression 
result. 


(9) Second Application. Test for Zero Contingency. 

Suppose that we may expect in the sampled population an absence of contin- 
gency or correlation between the variates considered. In other words we will 
suppose 

nyN es 
i a ree 


If, however, we take a sample from that population, the quantity 


: (n, — ,)” 
g=s are 
which is the mean square contingency in the case of a population with zero 
correlation, would certainly not vanish. The problem then arises: How great 
may the quantity ¢? be without making it highly improbable that the sample 
in question is really a sample from a population of uncorrelated material ? 
First of all, the mean value of 4? as determined from a large number of samples 
would be 
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as is obtained by substitution of A, = 7; in (viii), or, if x, = 1, 


p= yle—V). 


In the same way we derive from equation (xiii) 
OPC (Ce ne 
ot, = Ix (a+ = ’) Lope (hes My eel = 1} oe (xvii), 


where H is the harmonic mean of the mean cell contents, or for the usual particular 
case when M is very large as compared with NV 


F I ic cle 2a ie 
o's = HB if += N + 2(e- 1) weaned (xvi), 


. . . be FF l 1 
an expression which is very easily calculated especially as i will usually be small 
compared with c and hence a good rough approximation for a fairly large table . 
will be got from 


ee i i i Cr ero 


Thus if we take twice the standard deviation as a limit to the probability of 
deviation being that of a random sample, we have as a rough upper limit to the 
value, which 4? may be expected to take in any sample, 


(10) Numerical Illustration. 


In the example of the Contingency-table for Handwriting and Intelligence 
in Girls 


p? = = (c — 1) = 01943, 


and when calculated from the more exact formula (xviil) 
Og: = °004879, 
the approximation given by (xix) being -0046. 


Hence in accordance with our assertion above, we should regard any observed 
value of 6? which exceeds -01943 + 2 x -00488, i.e. -02919 or, say, -03, as being 
incompatible with zero contingency. The observed value of ¢? = -0958. 


The corresponding mean value of C—the coefficient of contingency—is 
-01943 : 
RORY) 
and the upper limit for C according to our assertion is -1684 or, say, -17. The 
observed value of C is -2957. Clearly there is definite association between 
intelligence and handwriting. 
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(11) Summary of Formulae. 


It will be convenient for purposes of reference to have all the formulae collected 
into one section. 


General Formulae. 
For ¢? defined by 


re) | 


: gens (ON 


where N is the number in a sample, n, is the number in the sth division of that 
sample and A, is a number connected with the sth division satisfying the condition 
S (A,) = S (n,) = N, we have proved that for an “infinite” sampled population 


1+ @=(1 x) SH) + SG) Peri. (B)*, 


68 (¥5) + (8 — 44%) 8 () = 128( )+ (2-462) 22— 56% 108" 


+ wleCh)+s)*fo(Q)f -as(ft) 26a 9(f) v<e—2) 


Ss 


Fi. Sen eee 5 
where ¢ is the number of divisions or cells and #, = 7,-A,, or, with a fair amount 
of approximation, 


ay =o E ( We) +¢2— [+ - ac (Ss s) cas (2) + +26] ...(D). 


Contingency. 


In the case of a contingency-table the sth division may be taken to be the 
(u, v) cell and A, = nits , Where n,, and n, are the marginal totals of the wth row and 
the vth column. Formulae (A), (B), (C), (D) are then directly applicable. 

Test for zero contingency. 


When there is zero contingency in the total population 


and (B) reduces to 


* As usual the bar over a letter denotes “the mean value of.” It is to be noted that usually there 
is only one sample and the value of n, in that sample has to be taken as 7,. 
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and (C) to 


~ N21 
where c is the number of cells in the table and H is the harmonic mean of the cell 
contents. 


ae \" +1) 4 2% oh a ae (0, 


Rough approximations in the case of zero contingency are given by 


- c 
Den pee 
N? 
and oy: = a ) 


and from these we can derive as a rough upper limit to the value of ¢? given by a 
random sample from a population of zero contingency 


Coen dee ae (1 | a 
T x/¢ . 


ON SOME NOVEL PROPERTIES OF PARTIAL AND 
MULTIPLE CORRELATION COEFFICIENTS IN A 
UNIVERSE OF MANIFOLD CHARACTERISTICS. 


By KARL PEARSON, F.R.S. 


(1) Let the universe consist of V individuals each the bearer of n characteristics 
symbolised by the numbers 1, 2, 3, ... s, s’, s’’, ...m respectively. Let 7,y be the 
correlation coefficient of the s and s’ characteristics, and let the whole system of 
total correlations be provided by the determinant A, where 


Te AFL 8 a en aR lee eer (aye 
Hoek wees Tos Ton 
Vs1> ls "ss lsn 
Dems ic. Mngancene Ll 


This determinant being symmetrical because 7.7 = ry. 


We shall use A,, for the first minor corresponding to the constituent in the 
sth row and s’th column, and A,,,”,” for the second minor corresponding to the 
first minor of A,, which is associated with the constituent in the s’’th row and 
sth column. 


Then if R,.493...(s)...n Genote the multiple correlation coefficient of the sth 
characteristic on the other n—1 characteristics, i.e. the n without s, and 
ss'P123.... (5)... (s’)...n Genote the partial correlation coefficient of the sth and s’th 
characteristics for the remaining n—2 characteristics constant, the following 
results are fundamental and well-known: 


A 
R S, 123i2 (S) aeons 1 = ie eieiekelaieicrecersietaltioretatstareieyevsistelevsre alate (11) 
$s’ P1293... (3). (s’) em = ee Bo aise ce Gitar cee ctee es (111). 
VG NG 


To abbreviate the subscripts we shall write these 
te and ss'Pi-n> 


but where others of the variates are to be left out of account we shall be obliged 
to introduce the bracket system to mark partial or multiple correlation coefficients 


232 Novel Properties of Partial and Multiple Correlation 


of lower orders. Thus R,.4 (9) -(—-n or simply B,.4 (yn, would signify 
the multiple correlation coefficient of the sth characteristic for n — 3 other charac- 
teristics, s’’, s’’” and of course s being excluded. Similarly ..p,_(e"y”siv)-n Would 
be the partial correlation of the sth and s’th characteristics for n—5 other 
characteristics supposed constant, i.e. the n variates with the exception of 
Sis, S's Sc wand sie 
(2) Preliminary Propositions. To express First Minors in Terms of 
Second Minors. 


While values of the second minors in terms of the first have long been known, 
i.e. the equations of the form 


ae Ay, Aye = Age Ager 


Agse's" a A tee cee ese er ccs wns vceveviecio (v), 


the reversed results, i.e. the expressions for first minors in terms of second, seem 
to me less familiar, at any rate I have not come across them in the literature with 
which I am acquainted. 


The forms to be demonstrated are 


A2 ' 
Me vw (A geste Negeiig Ne eee eet eee eee (vi), 
sss . 
x2 “3 
Ags” = v.. (Ageia Neagtes ae Niigcgt Nevergs\ ccc een eee nee (vil), 
sss 
where Vaya =| Dyes’ Ay | Nepal Seeccn ee ee eee (vill), 


1 } 
Ae ? Ags ? Ags 
A 
and V2 ge = AP) 2 Ai agg oe — Aree 8 = Neer eee (ix). 


sss's”’ > rE, | 


| 
ae 
b 


To prove these results I start from (iv) and (v) to express the value of 
Ags's’s’ Asss's’ ain Agsss Ag” SaSSu° 


s 
We find 
Ay Agree = Aegon NG Ne nena NEE Nee, 
Ag ss Agss's" or Ages” Ag's''ss! a ( ai: NEES vs") x “ eS Sa) 
A A 
a (Aye Ays = Ags Ags) ~ (Aye Aggy — Ay Ay) 
A A 

a= Bs (A Nyy Api Aggie = yy Atg = Ay Atiy OA 

i me ssAs's/ Ags” — Ags A* gn — Ags AX gs — Agr yr AX gy + 20g Ags ss!) 

iy Ags" 3 V 56's" 

= A ; 


which gives us at once (vil). 
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Similarly we have by (iv) and (v) 
Mee Agss’s" ae A? s56's" 
_ As: Ayy — AY sy 
a A 


2 
o 
SSi 


A (As; Ags a ING; Ager)? 
s Ae sSSt/, 


A 
ee Ae (Ag, Aggy Ags is Ags A?,4” — Ag giNeet. a Agi A? sy Si 2A gs Ags Ase) 


x AssAgrer — A 
A 


Ne 
~ A? 


which gives at once the result (vi). 


V 5's" , 


We shall now substitute values like (vi) and (vii) in our definition (vill) of 
V.s's7 and so deduce (ix). 


We have 


—A6 


ve. = ———. 
-: (Vies's")8 


(eee sel 

i} 
= Noho 
Nasal! Ag's'ss! 


( 


Nese’ Ag's'ss! 
ar Neate! Neate 


NX 


}; 


” 
s 


a 


Aggs's" Ags'ss" 
Nvaatst Ag's's's 
Ngs's"'s" Agss‘s” 


)| 


( ) fee =e) ( 
= eveats ‘ ar Agis'ss" Ag's'ss! 


Nea Nee Nios Nore 
3! 
Noe 


gic A gies” Ag’s’’ss! 
but the determinant on the right is the square of the determinant 
IRN Neri ee rN oie |e 
| | 
ii Ag's'se'3 Agy's''ss5 


= DNgesis | 
| 
a Ags's’s» rae ANY Seo iNeey 


) | 


el Agss’s! Ags's"s 
Aggs's" Ng 
Ar eee Agrs's's 


whence by taking the square root we have the result (ix). 


(3) Application to Multiple and Partial Correlation Coefficients. 


Equation (viii) may be put into the form 


IN 
Vss's" = A,sAgy Ags” I inca =) 
AG Age 


iNet Wo 


Waa 7 ’ ds 
ING Ags” V Age Ags” | 
= ; A? : 1p — ss'Pi-—n»  — ss"Pi-n | 
(1 =e se1—n) (1 me Ry, i-n) (1 ac Ry, 1-n) — ss'P1—n> i. s's’Pi—n | 
| — ss"Pi-n>» — s's’Pi—n> 1 | 


Pena ateee (x). 
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Turning to the expression (ix) we may write it 
; BUA Rae us ene 
V2 06" = A’ Agets’s” As's'ss Asse's | 1, ee ? ia Sate, 
| VA, Sissi Ag's’ss VN page Agss's! 
pee ergs 1 Tyee Agso's" 
V Age's's’ Agrs'ss V Agrs'ss Agss’s’ 
aa Aves's 's Jail Assia’ ] 
| 9/ Neen s”’ Asss's' V Ayrgrss Nses's! 


The determinant on the aa can be expressed in terms of partial correlation 
coefficients of one lower order, 1.e. it equals 


My ss'P1—(s")—n> — ss"P1—(s')—n 
ss'P1—(s")—n 1b s's"P1-(s)—n 
r h ss/Pi-\s))—n> — s's"P1—(s)—n> 1 
urther: 
INSIANGipaui/N ce Ne NG Ae AVE Ayre’ Agss's' Ag's'ss Nes 
s’s's''s" As’s!'ss Asss's A A A Nee Non ING 
Also: NG Ass Agy Ayr! Aggy Sif Age's Asce's 
A A Ags" Ass Ags 
a oAGe ! 
(1 saelte s.1- n) (1 = Ry ap) (1 az Ry ton) 
S 1 
2 \ = . 
Th (1 —R s.1-(s")—n) (1 es Reet =n) (1 as I Ree ele =) 
us: 
(1 aie Gralla) =) (dies of: 1=(s)— 5) (1 os Ry 1-Ws)—n) 


: — = (1 Fe re (1 — T= Gen) (1 sag R s eBlee) 
This is a relation between sets of three multiple correlation coefficients of the 


(n — 2)th order. 
Buu ee NG I 
ME mC (=o) i= mal ona) 
1 
= Btn) By awn) 0 — yan) 
<i ay is'P1=@—no -ss"Pr=(a =n) esate eee eee (xii) 
ss'P1-(s")—n> I, s's"P1—(s)—n 
ss'Pi-(s)—n> __s's"P1-(s)—n> 1 
Squaring (x) and combining with (xii) we find 
I, — s'Pi-n» — ss’Pi-n |? 
— ss'P1i-n> Ms _- — s's"P1-n 
— ss"Pi-n> — s‘s"Pi-n> 1 
(1 — B®, 1-2) (Ll — Ry gn) (1 = By 3) 
1, ss'P1-(s"l—n» — ss"P1—(s)—n | 
ss'P1—(s")—n> 1, s's"P1-(s)—n | 
arin ae an 1 | is 
* U2 Picea = Bec aC eee 
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The denominator on the right-hand side of (xiii) can be replaced by the left-hand side 
of (xi), or by the square root of the product of both sides of (xi). This is a relation 
between partial correlation coefficients of the (n — 2)th order with multiple coefti- 
cients of the (n—1)th order and partial correlation coefficients of the (n— 3)th 
order with me coefficients of the (n — 2)th order. In the particular case of 
three variates 1, 2, 3 with total correlations 7,5, 723, 7; (xiii) reduces to 


1, SeaePan aspen (Ets) R*,. 31) de St 12) | 1, 42, Tis 
Sonas I, si (US 97755) (1 = 973, = 77) | for, 1, Toe | 
— 31P2> — 32P1> yal | 31, 132, 1 

Poscsnaeice (xiv), 


a result which might possibly be of practical value in testing the accuracy of 
the determination of the first order partial and multiple correlation coefficients. 


I now return to equations (vi) and (vii), and I divide (vii) by the square root 
of the product of Ay, and Ay, obtained by cyclical interchange from (vi). 
We find 


Nea Neer hha RENE aA Oe ey : 
V Avy Mgrs’ V Agere Ngs'ss oe A? esis V Agre'ss Agistss a A? or 93s! 
Asse's" 3” Ags sss” ok gress! 


+ : 
_ VA yy esAs's's ’s'ss VE ss Negis's''s’ Sa — Sh reg Nei e's!s! 
1 Byles! i Vie x ae ‘sss’ 
if ee Ags'ss Agr sss 


or, changing sign throughout, 


a 


Pipes aes Bel oni e Pianist Pi wen ae 

(tis V1 ae Tea | ay ss'P'1—(s")—n 
(xv) is the familiar result for obtaining a partial correlation coefficient of the 
(n — 2)th order from those of the (n—3)th order, but the proofs usually given of 
it seem to be based on some appeal to general analogy rather than to the definite 
algebraical form of the coefficients concerned. It was, indeed, a lecture proof of 
the relation (xv) from the determinantal forms of the coefficients which led me to 
the results (vi) and (vii), as apparently novel determinantal relations. 


We can next consider results (vi) and (vii) individually. From (vi) we find 


Ns A8 be y Ag Se Naas? NY 
Cae) 


A Vies's” A A Ags Ags” sss’! Aeesi's” 
or, converting into partials and multiples, and writing 


€ 

Ces = lis — ss'P1—n> — ss"Pi-n |» 
— ss'P1—n> A — ¢'s"P1—n 
— ss"Pi-n> — s'sPi—n> il 
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we have 
ees ie mes (1 se eee) ( a i eters) (1 zz Ry, np) has 1 
Leh 1 P,,-2 (1 — By an) (1 — Bey an) 
Gee —— =n. OS Lae Serreltajerh) + 
(1 —- I erage) (J = RB, s()—n) ( eras 
Hence 


(1 = Le elie) e er Iter aCe) ee. 
V= ¥e/P 1-0 
This is an expression for a multiple coefficient of the (n — 1)th order in terms of 
multiple coefficients of the (n — 2)th order and partial coefficients of the (n— 2)th 
and (n — 3)th orders. 
Equation (xvi) just obtained may be put into another form by aid of the 
relations * 


(Be in)? 


2 
1 2 ae if ane Rk s.1-—7” 
— ss'P1-n 1 R2 ’ 
ini S.1-(s’)—n 
2 
| — “pp? = 1-k S-1-n _ 
Ss 1-7 1 an Ree iy a 


leading to 


. (i Ae 
bP 1 Le op 4n) * ; 
( ss'P”1—n) ( P1-n) (Re) ae 
oP a : 
= by (xvi). 
i espera en y ( ) 
Hence yar a ee eee (xvi)Pis, 


et ics ss'P'1-n) (1 = ss"P1-n) 

Thus (xvi)bis is the reverse of (xv), giving a partial correlation of the (n — 3)th 
order in terms of those of the (m — 2)th order. For example, if there be three 
variates, 1} 2, 3, 
eye 1 — s2P1 — 13P2" — 21P3? + 239P1 + 13P2 21P3 ; 

(1 — yops”) (1 — 13/2") 
a 32P1 a 12P3+13P2 
i 12p3°V 1 — 13P2" 
which can be easily verified by substitution of the values of the partial correla- 
tions, or be seen at once from the polar triangle. 


leading to 


For the particular case of three variates we may use (xvi) instead of (xvi)>Is, 

writing it 
(1 — R2,, 55)? = (1 oe: zu (I — 7713) | I, —2P3, — 13P2 |---(XVi), 
— Tas — 12P3> 1, — 23P1 
Ge 13220 en emacs 1 
which will be found on substitution of the values of the partial coefficients on the 
right and the multiple on the left to reduce to the familiar result that the square 
of a3 by 3 determinant is equal to the determinant formed by its minors. In this 
case of three variates, if 73, 731, 742 be taken as the cosines of the sides of a spherical 
* R. S. Proc. A, Vol. xct. p. 496. 
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triangle, then 43p;, 512, wes are the cosines of the angles, and Ry,5,, Ry.o3, Rs... 
are the cosines of the perpendiculars from the angles on the opposite sides. If 
a, b, c are the sides, A, B, C the opposite angles, pa, pb, pe the perpendiculars on 
the sides, then the above relation is 
ne, sin?b sin?¢ i —cosC, —cos B |. 
6 = 
sin* @ — cos C, 1 — cos A 


—cos B, —cos A, 1 


It is greatly to be desired that the “trigonometry” of higher dimensioned plane 
space should be fully worked out, for all our relations between multiple correlation 
and partial correlation coefficients of n variates are properties of the “angles,” 
“edges” and “‘ perpendiculars”’ of sphero-polyhedra in multiple space. It would 
be a fine task for an adequately equipped pure mathematician to write a treatise 

n ‘spherical polyhedrometry”’; he need not fear that his results would be without 
practical application for they embrace the whole range of problems from anatomy 
to medicine and from medicine to sociology and ultimately to the doctrine of 
evolution. 


Lastly we may turn to (vii), and express it also in multiple and partial correla- 
tion coefficients. We have 


Ags’ As ss's'9" ai Ne Aue Ay 
ay NG) | Voss" Fae Tee AA 
Neste! | ee 33” Agr sss 
«(7 MINE ENG Any Aga As —) ; 
or using the correlation symbols 
is’Pi-a = (1 = Re en) (1 = ay -1- ) (1 saa ay pn) : 1 : 
Pe 2 v(1 SUA ery ley tere re Na) 
] 1 


ee ———— : = : = 
v(1 ae R?, 4s) —n) (1 = te s.1—(s")— S.A iG Ry. ie aN (1 - ae, Ry s.1- a) 


x (s/s"/P1—(s) —n — ss"P1-(s—n * se Peeh =n): 


Or, 
s's/Pi—n — (1 and Teen) 
a of (1 oe. Reon) (1 a) ee sI-n ) Fe 
(1 oa eee ney) (1 as A ae) (1 noe Ry -1—(s)—n a= = 26 o1-(s” =) 
Ng ee elton SIE a col (xviil) 
n-2 


This is a complicated form and unlikely to be of material service. 
If we use the symbol ‘P,,, to represent the determinant 
15 ss'P1—(s")—n> — ss"Pi1—(s’))—n | 9 
ssP1-(s—n> Us s‘s"P1—(s)—n | 
| 
ss"P1—(s')—n> — s's"P1- (s)—n> if 
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we can throw (xviii) into the form 


eee as ey iD Py 
ia Tee, s_.1—-(s)—n 
sPr1i-n SOS (1 =: Yi Nena, >) (1 i Ry 1 (s)—n) 


gel Pi—l an SRI) as Pe ( 
VabEos 
Here on the right we have only partial correlations of the order n — 3, but the 

expression involves a multiple of the order n — 1 as well as those of order n — 2. 
Of course in the second radical s” and s’ may be interchanged owing to the existence 
of (xi). On the whole it appears best to confine our attention to (xv) and (xvi) 
as the fittest representatives of (vi) and (vii) expressed in correlation forms. (xv) 
has long been in use either to determine the partial correlation coefficients of 
higher orders by repetitional processes, or better to verify results given by (iii). 
The calculation of the higher multiple correlation coefficients by (ii) can be verified 
by the aid of (xvi) if the partials have also been found. But if once the value of 
A, has been determined the continuous product formula may often be advan- 
tageously used. Looked at from the determinantal standpoint this may be rea’ hed 
as follows: ; 
iN LNs TANG Asss’s” Assests"'s! AV to n— 1 terms 

WAG S Aa oN Neen meee 1 
= (1 a een) (1 aie I series ry) (1 a RP 1 -(68)—n) (1 ii Ry 4—(ee's)—n) m0.6 (l—7? pq), 

if the pth and yth variates be the last to be excluded. 
Hence* 


2 
eis s.1—n 


x 


ANS 
(1 . Bee ee) (1 an Jin Fe ers) (1 = I Meh e ata») re (1 ie rq) 


The trouble of this method is that A, has to be calculated at each stage, if 
we deduce R,,_, by a repetitional process. If we use it merely as a verification 
process for (ii) we shall not verify A,, unless it is worked out twice. Of course 
A,,, 1f n be at all large, is more troublesome to calculate than the 3 x 3 determinants 
of formula (xvi). 


However the primary object of the present paper is not so much to provide 
verification formulae as to show by direct determinantal analysis certain relations 
known and unknown between the higher multiple and partial correlation coefficients. 


* This must not be confused with the formula 
1 —..-p? a a UW Si esp 
see (1 — g5"'P*1(5')—-n) (1 — 35'"P*4-(s's"")—n) ql — g5''P*1—(s's"'s"")—n) sinie (1 —7° pq) ‘ 
see Yule, R. S. Proc. A, Vol. LXxtx. p. 189, and Pearson, R. 8. Proc. A, Vol. xct. p. 496. 


ON THE APPLICATION OF “GOODNESS OF FIT” TABLES 
TO TEST REGRESSION CURVES AND THEORETICAL 
CURVES USED TO DESCRIBE OBSERVATIONAL OR 
EXPERIMENTAL DATA. 


By KARL PEARSON, F.R.S. 


Let us suppose that a sample of size N with class groups nq, 1s taken out of an 
indefinitely large population of size M with class groups v,,, these classes being 
arranged according to two variates x and y. Then the mean of any array of a’s 
for a given range of y variates connoted by the centre, y,, of this (usually small) 
range will be 


Here n,,, the number in the «,, 7, class, and n,, the total number in the pth array 
of a’s, will vary from sample to sample. But 2, and y, will remain of course the 
same. Now let m, be the mean value of m, found from a large number X of 
samples and let us measure m, = m, + dm, from m, and n,, from vi,, = Nv,,/M, 
and m, from Nv,/M, or take mq = Np7+ O%yg, and n»=7,+6n,. Here the 
differentials are statistical differences and do not at present denote that we are 
going to neglect their higher powers. From (i) we have 


Mm, + 8m, = s (Per) on, (’ S (y+ - 
Np ie nie Np Np 
8n, 2, én 6n,\2 dn,\3 
g up”a Dp | D Baas) (ec 2 
+8( ii yi a eS Ge cae 


Now we shall sum this for all A samples (dividing by A) and suppose that third 


bn én 2 ; 
order powers and products of —* and ee are negligible as compared with lower 
n 
Dp Dp 
order powers and products*. If & denote a summation for all A samples 


X(6m,) Us (8m%_) _ -(6Ny) 
NO re, ee | 
since all these quantities are measured from their mean values. Thus we find 


se sae ee 
Tans 72”) (1 te ~ ene) Bg ( % % Pra Pte) ; 


An,” es \ 


0, 


p 


* Actually terms of the third order also vanish. I have not investigated whether this be true for 
terms of the fourth and higher orders. 
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2 ji 3S i 
But * = = ils (1 * , = ee Tees ( _ 7") 4. eee (il). 


and accordingly 


= mean of the array in the sampled population. 


Thus to a high order of approximation at least the mean of the array means is 
the mean of the corresponding array in the sampled population. This result 
cannot be taken as obvious, as the size of the array in the sample variesf. 


We can now write to a second approximation: 


ye eg Se) eG 2) 


Nis ne n Ny 


Steyn) ie 


N» ne 


= S {ONqp (%q — My)} S {0NqpON py (%q— My)} 
a =—t 


2 
5 Th 


since Vp 1S) (OE 


We shall now find the mean value of (dm,)? as far as third order terms. Let O, 
and QO, give the second and third order terms; let & denote a summation for all 
samples and A their number; write %, for z,—m,. Then 


LE {S (Bryn) 


oa Tae 
el S {2X (SN)? H7}__ 2S 1D (Sg phNg"y) Hy Hy} 
r iy? “2 ni,” 
>> ; 2 q = 
But al = iin (1 _ ¥)| 
sip'sledeise oot Oca nee Eee iv). 
>> (dn, pONg"'p) Ng! yNal'n (tv) 
A N 
Thus Os = s era S 2 ae! 28; (7% a iq’) 
p p Np 
ee AEs 1 S (NapXq))? 
iy ON ons 
=F simoc 8 (ip) Oc ie oo) ere (v) 
p 


* Biometrika, Vol. 1x. p. 2. 

} The assumption that the mean value of a character in a number of samples is the value in the 
sampled population is often made, but is nearly as often erroneous. Thus the mean value of the 
correlation of a character in samples is not the correlation of these characters in the sampled population. 
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Hence to this order we have om, = o7,/ Vi», precisely the value of the standard 
deviation of the mean on the hypothesis that the array is of constant size equal 
to the mean size 7,. We next turn to the third order terms O;. Here 

2 > S (6% p%.) x S Crettet al 
A nee 
9 SZ (ON pdMpHq?) 4 SU (ONg'pPNq'pOM py Ly’ Xq"’) 
Ar, ae Any? 


But the summations marked by = can be expressed by aid of (e) and (f) on 


p. 244 below. We have 
) at 28 {fare (1 *) ite 
2 


eae 2 = 2Nap Ny 
Oy 7/5 lip (1 i )( N 
nN n 2n 


Owe 


Ons Je n ~ 
=— i, = - i) , since S (n,»%,) = 0 
According] 2 oni, 2 Np 
rdingly OO a ae T-= a | (vi). 
p D 


The corrective term OQ; is, I think, of marked importance for it indicates that the 
standard deviation of the mean of an array in a sample is not the same thing as 
the standard deviation of an array of constant size. It is not therefore legitimate 
to assume, as some authors have done, that the standard deviation of the mean of 
an array is given by the same formula, Le. of //7,, as for an array of constant 


size equal to the mean number in that array for a large number of samples. 


Had we included the terms of the next order in (vi) we should have obtained 
in the curled brackets terms of the order 1/7,?. For a small array this might 
be equally important with the term already given in 1/N. Hence some caution 
must be exercised in retaining that term and dropping terms in 1/7,?; at the 
same time for the larger arrays 7,/N may be commensurable with unity. 


We may throw (vi) into the form 


2 
Oo" 5 : 
Om, = SIRE Ea Beokae CEng oboe (vi)Pis 
a (1 = x) 42 
i N 
= orp for many purposes 
Ny 4 o) , J Pp ty 4 
This result agrees only with that given bv me in 1905*, Le. 
On, 
om, = —, 
Ny 


to a first approximation, i.e. when we may neglect 2/N as compared with unity 
and 2 as compared with the mean number in the array. To assume without proof 

* “On the General Theory of Skew Correlation,” p. 14. Drapers’ Company Research Memoirs, 
Cambridge University Press. 
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the above value to be true is not legitimate as there is no a priori reason why 
the standard deviation of the mean of an array of variable size should take, even to 
a first approximation, the value of the standard deviation of the mean of arrays 
of constant size, that constant size being the reduced frequency (i.e. 7, =v,N/M) 
of the sampled population. Actually a better approximation to o,,, appears to 
be obtained if we add two units to the mean frequency-—a correction which may 
be of considerable importance for small arrays. 

We next turn to the correlation of means and we have to evaluate the product 
of two expressions like (iii) for 6m, and dm, where p is not equal to p’. We shall 
obtain, summing for A samples and dividing by A, the value of om, om, Rm,m,y> Where 
Rm,m, 18 the correlation of the means. We shall again treat the square and cubic 
order terms separately: 

Square order terms in Om, Om2m,m, 

_ g {2 OraroXan) sa TS | ea 
| Niphy Z NipTy 
x Peete) My My LU (dN,dNy’) 


Nip Nyy Ny Np! A 


— MyS 


ere 
S (ign Nap’ LqXq') Ate 
=a i 


S (Mein La) 
Niyty 


?  Niipiiy 

S(Ng'p'NpLq) _ MpMy Ny Nyy 
Niipty nts VN 

My My | My My | MpMy — MyMy 


ND NGS aan: N 


SE 


= ()). 
Thus we see that as far as square order terms are concerned: 


Retcsttig =O. (case des yediaghene ts eee eee (vii), 
notwithstanding that there exists a correlation between the numbers in the two 
arrays on which the means are based. So far this result is only true to a first 
approximation *. But we now turn to the cubic terms given by the product of the 


square terms in 6m, with the linear terms in 8m,. There results the four separate 
terms: 


U{S (8Nqy'%q) S (82%qp5NyXq)} Ee MS {d (SN%qpONp ON’ Lq)} 


Ani? N yy Ai Ny’ 
ae My S {a (O%ay" (dn,)?) Ty} My My 2 {ony (3%y)7} 
An, Hy’ AR? Ny’ ; 


1 have evaluated each of these terms in succession by the use of formulae similar 
to those on p. 244, and all these terms give the same result, i.e. their sum with 
proper signs to its constituents 


— Bee (1 =) |) 2) MyMy (1 _ fig) | MyMy (5 Bip) _ 
Nii, N Nir, N Nii, N/' Ni, ( Nas 


* T have given this result in memoir just cited, p. 13. 
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Interchanging p and p’ we obtain precisely the same result for all the four terms 

in the product of the square terms in 6m,, with the linear terms in 6m,. Thus 

the cubic terms in the values of oy, om Rm m,, are also zero. We have thus 
p ™p pmMmyp 

established to a high order of approximation that (i) m, = mean of array in sampled 

population, (11) that there is no correlation between the means in any two different 

arrays, while (iii) to a lower order of approximation only o,, = o7,/V iy. 

Now we know that the distribution of means of samples of constant size taken 
from a population following any law of frequency approximates very rapidly as 
the size of the sample increases to the normal law. How far may we extend this 
result to the present case of the means of arrays the total frequency of which 
varies from sample to sample? 


With the view of considering the approach to a normal distribution, let us 
investigate the third moment coefficient of m, for the samples of the pth array. 
From (ii) keeping only lowest order terms we have 


dm, = S (8Xan%y) 
Np 
and accordingly 


POM pee Wi la 4 Ui(On, 5) ea LX (Sign)? ONg'p By? Lo 
mee Sane s t + 3S | i = 


m\ 
| 6S = SE) “ete 


ing (a), Chon) ps ¢ 
Lae 1 n 2n 
= lob IN 
. = 2Nap Malm esa re ) Nap Nq'yNq” ‘Dm mom 
= 38 Tay (1 ze H) neD FH AGiyt + 128 | 22"? Gy 


1 ao Oh ea aes 
- 23 Is (Tgn%y?) aN S (Ngpq") S (Tigp%q) + af (Nap Lg ad . 


The last two terms vanish with the factor S (7,,7,). Thus 


X{dm,)? 1 1 
asia =e iy? al43 > 


Ny 


where ,/43 1s the third moment coefficient of the array about its own mean in the 

population sampled. We have seen also that to the same degree of approximation 

x (dm,)? : 5 : 
a »)° ae where pi. = oa: Accordingly if ,B, be the value of the first 


B-coefficient for the distribution of the means of the pth array in samples: 


iS = na) / eta 


16—2 
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where ,f, is the first B-coefficient of the pth array in the sampled population. 
Hence whatever be the nature of the frequency distribution in that population, 
if the number in the array be not too small, the distribution of the means of arrays 
will approach symmetry, i.e. ,B, will be small*. 

The following values have been used in the previous investigations, some of 
which have and some of which have not, as far as I am aware, yet been publishedt. 
The remainder will be required in investigating ,B,, or the value of & (6m,)4/A. 


Third Order Mean Products for Random Samples in the case of an indefinitely large 
Sampled Population. 


X (8%qp)° ene Nap 2Nap 
° Pot $9)(-3) 
x CZ Nq'p} ; NapNa'p 2Rap 
Mu Saw ga eat 
(c) x (32%qp52q'p9Nq’n) a 2Tign Ng pNq"y 
A Ne ‘ 
(On ,7ONg) Ny 2» 
(d) mes ote (ea) Ca ai 
p> (82% p8NqpONq'p) 2Rap Na'p Ny 
, neat) Hhar(_) 
B {ir (8m) _ Dep (, _ Tip 
(f) r = Nap (1 N )( 7") 5) 
x (6n,)? _ Ny 2N»y 
(9) =x ell) tact 
2 
(h) x Bg eh : =i 
? X (8M p5NqdNy) Tgp Ny = 
( Se ee 
: x {(6m,)?5n,} 7x, Ths 
9) ae 
, \2 = oie 
(I) 3; 7 SNgp} a ~ \(a M,)? - o*, é 
X (6m, (ONqp)*} Nan /, & Top 
(1) Xr ear (%, My) (1 fae N ), 


X (5M pSNqpONq'p) Nap Ng’ 
pONap OM a'p ap "'q'p —- — 
= _— ZS Oba UL) Lat — mM . 
y iy N { a p + Xq pt 
* Even if the array diverged considerably from the Gaussian value, i.e. »8,= +2, say, then »B, for 

an array of even 10 only would be but -02 and the asymmetry very slight. 
+ (a) and (9), Phil. Trans., Vol. 186, A, p. 347 (Pearson, 1894); (6) and (c), Biometrika, Vol. 1x. p. 95 

(Soper, 1913); (d)-(m) probably here for the first time. 


Kart PEARSON 245 


The following fourth order mean products are deduced from the more general 
values given by Isserlis*: 


(n) x (8gn9N%q’p9Na’p9Na’"p) 49) (1 < 7) Nan Na'p Nq'p Ng" 
r\ 


x {(8Nqp)? 5Nq'p5Nq’n} ( 2 Nap Nq'yNg'p 3Ngp 
(0) r Sak I=) yo n’)> 


(P) E (Greg)*Bnyg} = — THB"? {3 (1 — 5) tap (1 —"*) +t, 
x {(8%qp)? (8Nq'p)"} =— = ( 2 ( Nap Na'n \ STiyy Nyy | I 
(q) ee) Tee 1—¥) C2 NL Nee aN ): Hp? 
while 
4 = ss 
(7) 2 (al — (1 -){s (1 - 5) hap ( -#)+ i} 


is due to Pearson ft. 


I shall now use (n) to (r) to determine & (6m,)4/A to the first order terms. 


B (im, a= Ff een 
A D 
= [8 (Ee) + ag (FP eee 
= r 


g € (8p) 28 Ng’) Ng" p Lg? Lg Lo 
A 


+ 12 


_ 6S (= (3%qp)” Cua) 


4 248 (Phen tea eae ee 
a \ . 


Substituting from (n)-(q) above we obtain the following result: 


) 
DOM) = ek i Ties re Hee re 
= aa a Is \Fu (le Sl area) x 


Nee Nes Neen o) 
9 ap gp Nap NQ""p pie eh i 
+ 24S | N2 (1 = Lg Ly" Le . 


* R. S. Proc. Vol. 92, A, pp. 28-29 (1915). 
t Phil. Trans. Vol. 186, A, p. 347 (1894). 
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O} 
W and those 


I now rearrange this into terms not involving the factor 1 — 


involving it. The former are: 


1 = Nap\ Na'p x ~ Na'p ~ 9m 
=| $ {Reo (1 — 32) at — 1s Taxte sno 3 8 ish + 68 | at 2a | 


1 Per eats epee es ane 
~ Fig! [S (ign Be* + 3 LS (Rgp%q2)}2/N — 4S (ign itg®) x -S (Mgr ey | 


ie | 32» 
= = 3 | ofa ar aa ofa? | > 
p 


since S (7igp%) = 0. Here ,pu, is the sth moment coefficient of the pth array about 


the mean. I now take the terms involving the factor 1 — 2 


N° They give 


+ (1-5) 39 Jn? (1 — “her 4 Fe) as — 129 {Maal (1 — Te) a tty 
lr o\l = yet N ny) Rate 


_— 128 Nap" a'p"a"'» l 3Tap mom 
2K By? Lg Lqr 
N Na jven bauta 


Rap Ng’ pNq'p Ng” ; 

A>) EC EGET IETD 3 RR GO 
+ 728 | N2 LL Lq'Xq | 
aF 


= ele H) |S ont 2S (Raph el Me q)}? a 


where the equivalence is most easily verified by expanding the latter expression 
and ticking off the corresponding terms in the previous one. 


Since S (7,»%,) = 0, we are left with the single term 


3 tie 2 
all w) pha” 


Thus we deduce combining our two sets of terms: 


2 (Smy)* _ ba , 3 (1 = x) fg" 
r Ti? | N/ 7,2 7 


X (8My)?)? pba? (pBo | 1 
a ESM eles Bh 


where ,,B, is the second f-coefficient of the distribution of m, for the pth array, and 
»bz is the second f-coefficient for the pth array itself in the sampled population. 


Now to the degree of approximation needful 


aioe at 2 (1 =) 
Ne ean ii, N/\° 


Accordingly 
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or DRi 3 Op (= = x) 


Thus we see that the condition for normality of the array mean, i.e. ,B, = 3, is 
by no means so nearly satisfied as is the condition for symmetry of the array mean, 
»B,=0. For, while the latter might be fairly closely approximated to by an array 
of 15, even if the array distribution were not normal, yet the former even in the 
case of normality and of a big sample would give ,B, = 4, a wide deviation from 
the normal. Thus the array must be fairly considerable for the distribution of 
its means to become reasonably Gaussian. As a rule the distribution will be 
reasonably symmetrical but is leptokurtic, i.e. ,5, > 3, and therefore is to be 
described by a curve of my Type VII or one of the form 


/ ie —s 
y=yo(1+ 5) 


rather than by a curve of the normal typet. Practically multiple frequency of 
this form is at present undiscussed and we are thrown back on treating the arrays 
as giving normal distributions of their means. Thus the assumption made by 
Slutsky in the paper cited p. 248 can onlv be considered as approximative, and 
that assumption was not legitimate until its degree of approximation had been 
investigated. It is singular that the goodness of fit theory can actually be applied 
with greater accuracy to test physical laws than to test regression lines. 


It is clearly only in the large arrays that the kurtosis (,B, — 3) approaches the 
normal value zero. What must be understood by ‘“‘large” can easily be estimated 


roughly. For 


pba 3. = det B nearly = 15/n, roughly. 
p 

Hence if 7, = 75, ,B, would equal 3-2, which is certainly a limit to what may 
be roughly treated as a normal distribution. Accordingly when we assume a 
normal distribution for the means of an array, we must remember that this is 
really very rough in the case of the smaller arrays, and that we only do it in default 
of a better theory. At the same time it must be noted that the small arrays will 
have less weight than the larger, and the error made in assuming their distribution 
normal will be of far less significance for the scme deviationt. Thus far we have 
shown that (i) the means of different arrays are uncorrelated, (ii) that the standard 


ate =r . Si eae 
deviations of these means are given by Gal V ti, where 7”, = in (1—) + oH 


* This agrees with the result given by Isserlis (Joc. cit. p. 31) only when 7%, = N, i.e. the “array” 
is as in his case a marginal total. 

{ For the curve to be of normal type s must be large. If the array were normal in the sampled 
population and the sample large, then s = (2x, + 25)/10, and this would be only 8-5, if m» were 30. 
Thus the application of Gaussian theory to samples with even minimum arrays of 30 can only be 
approximative. 

{ The deviations in the means of the small arrays are likely, however, to be much more irregular 
and greater than in the case of the large arrays. 
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and (ii) that the distribution of the means of arrays is for the larger arrays approxi- 
mately normal, but that for the smaller arrays the distribution is approximately 
symmetrical and markedly leptokurtic. The small arrays will, however, have 
small relative weight and this is the only and not very satisfactory reason for 
adopting a Gaussian system throughout. Slutsky * assumes the normality without 
I think adequate investigation, and for that reason I have been unable to consider 
his paper as adequate and final in this matter. 


With the assumption stated above we can use for the multiple regression 
surface 
sy ee ae 
Z = Rye oR 


») 
where Th ey Be (1 = x) ae 


Here m, is the observed value of the mean of the pth array, and m,, oj, and Ny 
are constants which if it were feasible ought to be obtained from the sampled 
population. iS denotes a summation for all values of p from the first to the last 
array. Now if we suppose m,, oj, and 7, known, we can calculate 


for, say, the series of ¢ arrays, and as there will be ¢ independent variables all we 
have to do is to determine from this value of x? the value of P, the probability, 
corresponding to it in the tables for “goodness of fit’ } under the value n’ =¢+ 1. 
This apphes to any form of frequency surface giving any type of regression line, 
1.e. locus of my. 


Thus far the problem looks straightforward, but now arises the difficult question 
as to what values are to be given to 7,, m, and oj, Which represent the unknown 
sampled population. Usually in problems, as of probable error, where we have 
the unknown constants of the sampled population we replace them by the corre- 
sponding constants of the sample. But it appears a somewhat arbitrary course 
to do this in the present instance (as suggested by Slutsky t) for 7, and Cn,» but 
not for m,. I do not think it accordingly legitimate to substitute for n, and 
a; the sample values and leave m, to be determined from other considerations. 
Clearly since our object is to test the goodness of fit of the regression line we have 
to replace m, by f (yp), where 

My = f (Yn) 
gives the regression curve or mean value of the array of 2’s corresponding to the 
value y, of the other variate y. Of course this regression curve is determined 
from the whole series of observations and not from an individual array. But 


* “On the Criterion of Goodness of Fit of the Regression Lines, ete.” Journal of the R. Statistical 
Society, Vol. LXxvit. p. 79. 

+ Tables for Statisticians, p. 26. 

t loc. cit. pp. 78-84. 
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m, is nevertheless subject to the probable error of the random sampling. I have 
shown that the probable error of m,. if the regression be linear, is 
S14400.VT= Fay 4 Wo Oty 
VN : 
But this of course is not the probable error of m,, but of m, as found from the 
regression line, and will generally be small as compared with the probable erroz 
of m, found from a single array f. 


Oy 


Just as we have found m,, however, from the whole system of observations, 
so it appears to me we ought to determine o?;, and 7, from the whole system and 
not from a single array. The centre of oj, 1s m, and not m, and to calculate a 
value for oj, with m, as centre seems to be an erroneous step especially when, 
owing possibly to n, differing considerably from 7,, m, is much displaced from 
m,. I hold that for satisfactory results it is just as needful to find x, and oj, 
from the whole range of data—not from the individual array—-as it is to find m,. 
This means that we must have some knowledge of the form of the frequency 
surface, and until we have this we cannot apply the test for goodness of fit to 
the regression line. There are accordingly two separate factors remaining for 
solution after we have determined the regression line: 


(i) We need to determine 7,. This is the total frequency of the pth array 
of xs. It can clearly be determined, when we know the frequency of the y’s 
in this group. That is to say, we must determine a theoretical distribution for 
the marginal distribution of the y-variate. In some cases it will be sufficient 
to assume it Gaussian and then the table of the probability integral will suffice. 
In other cases it will be advisable to determine a skew frequency curve. But 
as a rule it will certainly be needful to graduate the array frequencies by some 
process, and not to assume them given by the observed marginal frequencies f. 
The bringing of the determination of 7, into line with that of m, does not seem 
therefore to present great difficulties. 

(i) We need to determine oj. If the frequency surface be homoscedastic, 
then o®; =, (1—7,.,), if m., be the correlation ratio of x on y, and the 
regression be skew. But if the regression be both homoscedastic and linear, then 
ori, = 6,7 (1 — 7*,,,), where 7,, 1s the correlation coefficient. In these two cases 


we may write respectively 


2 aa fil"y (my — Fi), 


Cae (1 es Tak 


* Biometrika, Vol. 1x. p. 10, with the necessary changes in notation to fit the notation of this paper. 

{ Extreme arrays here again form an exception. 

{ A precisely similar difficulty arises in working the ordinary expression for mean square contin- 
nng 


gency, ie. 1+ ¢2=S8 (; 


Np. Nig 


Ie where 7,. and 7.7 are the marginal frequencies (reduced of course in 


the proportion of size of sample to population) in the sampled population, not in the sample, although 
we ultimately use their sample values. There is more justification, however, in this use, for contingency 
is usually applied to broad categories and in such cases we have, perhaps, 3 to 7 marginal groups only; 
there is thus relatively less fear of big irregularities in 7). or 7.q such as arise with the small arrays of 
regression lines. 


Price of rye in Samara 
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and v= 2 (1 a 7.) S {n"", (Mm, — M,)*, 

which admit of easy calculation when once we have determined that characters 
of the sampled population m,, 7,, G, Fry OT Ar. may be replaced by the like 
characters calculated not from a single array but from the whole material of the 
sample. I am inclined to believe that 


— y Meu)” = ru)? 


Le 7 Ley 
Riera ae 
would be an effective measure of y?, where, however, we must not put 7, 


y= Ney 
in the numerator for we are actually measuring the improbability of the deviations 
of ny. from H,,,, 1.e. of m, from m,. But such a form would be of little use unless 
we knew the sampled population. It indicates, however, how risky is the problem 
of replacing only certain of the sampled population values of the constants by 
those of the sample. 

In any given case it appears best to draw the scedastic curve, i.e. plot Fn, to 
yp» This curve may be fitted with either a straight line or parabola of the second 
order, according as the array variability decreases or increases in one or both 
directions from some fairly central array. This is perfectly easy, but care must 
be taken to weight the standard deviations of the arrays with their frequencies. 
Very rarely as I have pointed out previously are the data ample enough to justify 
anything but a linear scedastic curve. If this line be practically horizontal then 
we can take 07; =o," (1 —7*,.,) throughout. I propose to illustrate the whole 
process on the two examples selected by Slutsky. 


Iilustration I. Prices of Rye in Samara. 
The example is given in Slutsky’s recent paper* and is as follows: 
Correlation between prices of rye at monthly intervals. 


Price of rye in Samara a month earlier 


| _Copecks 


Totals 
per pud 


25) | 300 <26."| 440") 25en |e 50) leooen|) sco eComml ee Omnia 


= 
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ewe 


= OO 


* “On the Criterion of Goodness of Fit of the Regression Lines and on the Best Method of Fitting 
them to the Data.” Journal of the R. Statistical Society, Vol. Lxxvit. p. 81. 
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The data are not very suitable, but I use them because Slutsky has done so. 
Their unsuitability arises (i) from the smallness of the total population, but 
especially (11) from the marked signs of heterogeneity obvious in the marginal 
totals. If y be the vertical, 2 the horizontal variate I obtain, assuming that the 
column headings represent central values, the following constants for the distribution : 

M, = 47-1572 copecks per pud, o, == 13°7769, 

m, = 47-0363 copecks per pud, o, = 13-6852, 

sy = 900,04), 

Regression line: y, = :947,205x -+ 2-3687. 

My values of o,,0,, %»,, and of the regression line constants, differ very considerably 
from Slutsky’s. I have not used Sheppard’s correction which would, however, 
only have emphasised our differences. ! have not reworked Slutsky’s first method 
with these changes because I do not think it is the correct method, i.e. he uses 
observational values for 07, and thus I cannot say whether the above values 


would improve his bad fit (i.e. P = 0-02), but I have adopted my own value 


0°, / (Ti (1 ios - se 2), 


using for o°;, the mean value o,* (1 — 7°,,) = 16-9965. Slutsky takes for the 
value of o;,,, on the assumption of homoscedastic arrays, the mean of the observed 


standard deviations of the arrays, i.e. 
ees S = 3-8022 
on = Wy! (Np Op,) 22 


according to his values, or he takes 
7; = 14-4567. 
This value is, I think, theoretically incorrect, the mean value of o°,,, =, (1—1,y) 
and this must be the homoscedastic value. Clearly Slutsky’s value is too small. 
The point remaining is the value of 7,. I should naturally determine it from 
the frequency curve for the marginal x-totals, but the extreme irregularity of the 
marginal x-totals—due partly to paucity of data, but more to probable hetero- 
geneity—makes any such process unsatisfactory. I have therefore taken 7, = to 
the observed array frequency—a result with which I am thoroughly dissatisfied, 
but which appears to be the only course. We have then the table on p. 253. 
IAG. 
Thus = ee = 15-11, 
Looking this out in the Tables for Goodness of Fit we find for n’ = 11 + 1 = 12, 
12) c= ol lisy 
Thus we see that the fit is passable, although not brilliant, much better than 
Slutsky’s P = -02. 
Slutsky also gives by his second method y? = 15-1 and P = -18 for the fit on 
the hypothesis of homoscedasticity, but I think this can only arise from a curious 
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xarray | 2p= Np @ = =) +2 | Observed m, |Theoretical mp | (mp — mp)? | n”, (mp — Mp)? 
26 10-8548 28-333 26-049 5:2167 56-6262 
30 22-6613 29-523 30:°785 1-5926 36-0904 
35 9-8710 34:375 35-521 1-3133 12-9636 
40 5-9355 42-500 40-257 5-0310 29-8615 
45 | 16-7581 44-000 44-993 “9860 16-5235 
50 | 26-5967 50-800 49-729 1-1470 30-5064 
55 | 10-8548 55-500 54-465 1-0712 11-6277 
60 | 14-2984 59-600 59-201 -1592 2-27631 
65 14-2984 62-200 63-937 3-0172 43-1411 
70 7-:9032 70-000 68-673 1-7609 13-9167 
vo | 3:9677 72-500 73-409 8263 3°2785 


S{n, (Mp — Mp)?} = 256-81196. 


balancing of errors. For (i) his theoretical m,’s differ considerably from mine 
owing to the divergence in our values of standard deviations, correlation and 
regression, and (11) he has used a different value to mine for ,, and what I believe 
to be an erroneous value of o?;, —namely too low a value. It appears to be thus 


a mere chance that we should reach the same result. 

Illustration II, Auricular Height of School Girls. 

Slutsky takes this example from my memoir on Skew Correlation*, and it is 
a peculiarly good illustration for the following reasons: 

(i) The regression line is distinctly skew. In my paper Y,,, the mean deviation 
in auricular height from the mean auricular height of the general population of 


girls who differ from the mean age of the general population by X,, is given 
by the cubic 


Y,, = -296,076 + -722,886 x X, — -029,580.X,,2 — -002,223X,,3, 
and the goodness of fit of this regression line is to be tested. 


(il) The frequency distribution of the ages of the 2272 girls is: 


cm hs ehet at) G7 729.1 89 | 670. | 1017 | 1112 | 10-73 | 13—14 | 

Frequency 1 7 18 40 | 76 | 152 177 235 261 | 309 | 263 | 
Age... | 14—15 | 15—16 | 16—17 | 17—18 | 18—19 | 19—20 | 20—-21 | 21-22 | 22-23 
Frequency | 198 | 214 162 95 61 13 7 | 8 2 | 


This is not normal. We find for its constants: 
Mean age = 12-7007 years, 8, = -001,335, 
o, = 3-064,819 years, By = 2-710,593. 
The probable error of 8, is about -045 and thus £, cannot be the result of sampling 
from Gaussian material. £, is sufficiently near zero to mark the distribution as 
substantially symmetrical. 


* Drapers’ Company Research Memoirs. Biometric Series 11. p. 34. Cambridge University Press. 
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The distribution can accordingly be described by a symmetrical limited range 
curve of my Type II: 
g2\m 
Ue yo(1 aa =) . 


The values of the constants being found we have 


} 4 ‘De 2) 7°866,024 
y = 283-7415 {1 = iurRe , 


This curve was drawn on a large scale and the frequencies for each age 
integrated with the following results: 


Frequency , Frequency 


Age ; Age aren 
Observed Calculated Observed | Calculated 

3—4 ] 1-5 13—14 263 275 

4—5 a 7:5 14—15 198 246 

5—6 18 19 15—16 214 197 

6—7 40 41:5 16—17 162 143°5 

7—8 76 76 17—18 95 91-5 

8—9 125 123-5 18—I19 61 53 

9—10 177 178 19—20 13 26 
10—I1 235 227°5 20—21 7 9 | 
1J1—12 261 265:5 21—22 8 7 | 
12—13 309 | 282°5 22—23 2 1:5 

| 


The goodness of fit of the calculated to the observed array frequencies was tested. 
I found x? = 25-4769 giving P = -146, or such a sample would occur about once 
in seven trials. The fit therefore is a fairly reasonable one, and the above values, 
not the observed ones, have been used for the array frequencies. Clearly they 
effectively smooth the random sampling. 

(ii) The standard deviations of the arrays have been given by me*, and it 
has been shown that the arrays are very far from homoscedastic. The value 
of 7 1s 303,024, which combined with o, gives for the mean square standard 
deviation of the arrays in squared 2 mms. units 

Gp? = 10-835,433. 

I now somewhat diverged from the plan of my memoir on skew regression. 
I sought the best fitting straight line to the weighted squares of the standard 
deviations, i.e. I made 

u = 8 {n, (¢,2 — Ax — B)} 
a minimum, where », is the frequency of the pth array of auricular heights for 
girls of age w, and a, is the standard deviation of this array. In working this 
I omitted the first and last arrays as quite unreliable. I found A = -436,706, 
giving for the line 

On? — Ty? = °436,706 (x — &), 
or op? = °436,706x + 5-290,970. 


* Loc. cit. p. 34, Table and Plate I, Diagram II. 
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From this equation the squared standard deviations of the arrays were calculated 
with the following results, the unit being 2 mms. : 


Squared Standard Deviation | Squared Standard Deviation 
Age array 5 pe Sale Age array {aa a1] re | 
Observed —Caleulated Observed Calculated 
3—4 [07] 6-8194 13—14 11-2822 11-1865 
4—5 8-3250 7:2561 14—15 | 12-8630 11-6232 
I—6 8-5708 76929 15—16 12-0152 12-0599 
6—7 8-7859 | 8-1296 16—17 14-9738 12-4966 
7—S& 8-9293 | 8-5663 17—18 10-0356 12-9333 
8—I | 6-9517 9-0030 18—19 9-7563 | 13-3700 
I—10 11-4765 9-4397 19—20 23-4314 | 13-8067 
10—I1 8-7930 | 9-8764 20—21 6-4065 14-2424 
L1—12 10-2970 10-3131 21—22 17-1512 14-6801 
12—13 10-2791 10-7498 22—23 [9166 ?] 15-1169 | 


The calculated results give a reasonable graduation of the somewhat erratic observed 
values, and these values were used in evaluating the formula 
n= gee = 
x S | on 5 

The following table now gives all the required data for the calculation of y?, 
and we deduce x? = 9-5845 corresponding to P = -974 for n’ = 20+ 1=2]1 as 
argument. 

Regression Curve for Girls’ Auricular Height with Age. 


| | a ae 
Age group, 7, y= ( 1- x) +2 Mp* my, * | (mp — nip)? * | ont ut Soa 

34 | 1-5 3-4987 115-25 | 116-90 | 2-7225 | 68194 1:3968 
4-5 75 9-4934 116-96 | 117-66 49007-2561 “6411 
BG 1 a 20-9833 117-47 | 118-42 9025 7-6929 2-4617 
Guyer 241-5 43-4635 119-10 | 119-24 0196 8-1296 -1048 
(=e *76 77-9331 120-30 | 120-03 0484 8-5663 4403 
8—9 | 123-5 125-3913 121-63 | 120-93 -4900 90030 68246 
ome: 3-178 179-8433 121-72 | 121-78 00369-4397 0686 
F011 | (227-5 229-2997 122-82 | 122-62 0400 9-8764 9287 
11—12 | 265-5 267-2663 123-14 | 123-42 0784 | 10-3131 2-0318 
12-13 | 282-5 284-2513 123-89 | 124-18 0841 -10-7498 22238 
1274 | 275 276-7579 124-86 | 124-88 0004 11-1865 0099 
15 | 946 247-7834 125-71 | 125-52 0361 11-6232 7696 
W516 || 197 198-8266 126-16 | 126-07 0081 12-0599 -1335 
16—17 | 143-5 145-3737 126-53 | 126-52 0001 12-4966 0012 
W718 |) 91-5 93-4195 126-91 | 126-87 -OOLG 12-9333 -O116 
WS 198\. 53 54-9533 127-02 | 127-09 0049 13-3700 0201 
19—20 | 26 27-9771 129-56 | 127-18 | 5-6644 |: 13-8067 11-4780 
pyre 9 10-9921 123-82 | 127-11 | 10-8241 14-2434 83533 
2 Wee 8-9938 126-50 | 126-88 1444 14-6801 0885 
Dp | 1 3-4987 125-25 | 126-48 | 1-5129 | 15-1169 3502 
| 2272 = | 22 = ae | Total y’? = 38-3381 


True x? = 4 = 9:5845 
* In millimetre units. For values of m, and m, (cubic (c)) see loc. cit. p. 37. 
+ In 2 millimetre units. Hence x? as given above must be divided by four, to obtain true value. 
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We are now in a position to appreciate the influence of the various factors in 
the solution for this case. I hold that as m, must be given the theoretical or 
calculated value, so also must the standard deviations of the arrays, 1.e. 0,7. But 
as we do not know the sampled population in this case we must obtain o,” in 
precisely the same manner as we find m,, 1.e. not by taking a value obtained from 
the individual array*, but by values graduated from the whole sample. Further 
it does not appear correct to take o,*= 0°; /7,. The latter is only true when the 
number in the array is a priori fixed, that is to say is not itself provided by the 
random sampling. In the latter case we must take o,? = 07; /n'’>, where 

pee (1 as x) +2, 
to a second approximation and this modifies considerably the standard deviations 
of the smal] frequency arrays. If in the above example we use 7,, the theoretical 
frequency of the array, instead of the value 7’’,, we find, still using the theoretical 
a5, X° = 86166 giving P= -986. Slutsky*, who has used both the observed 
frequencies and the observed standard deviationst of the individual arrays, 
finds x? = 9-17 and P= -980. It will be seen. that the correction for 7’, is 
the more important factor. In this case no marked changes are produced 
by using observed quantities instead of graduated values, but I think that in 
short series very fallacious results might be reached by this process, and the present 
paper is written to suggest caution at this point. 7, and o%;, are as definitely 


at m, values in the sampled population, not in the sample itself. 


Application of “Goodness of Fit” Theory to testing Physical, Technical or 
Astronomical Measurements. 


In these cases there is no question in the ordinary sense of a frequency surface. 
The physicist makes a few measurements of a variate A for each of a series of values 
of a variate B. He plots the mean of his measurements for A to each value of his 
variate B, and he enquires whether the curve given by his series of mean values for 
A is closely approximate to some theoretical curve. It will be seen that his problem 
is very similar to that of the statistician. He has a number of means for the 4 
variate, m1, M,... mM, ..., and he considers whether they are good fits to a theoretical 
curve—the statistician’s regression curve. Obviously in this case these means are 
non-correlated as approximately in the statistical case. Further the variability of 
a mean will be given definitely by o,,/V/7,—not now as an approximation. Here 


7, is the number of observations in the array on which m, depends while oj, is 


* Loc. cit. p. 81. 

+ In the case of the 3-4 array of one with observed o, = 0 Slutsky says this is due to random 
sampling and extrapolates a standard deviation; this is of course only a first slight step towards the 
proper graduation of the whole system of array variations. In the case of the array of two for 
23-24, the observed value is c, = 1-9148, which is just as much an inconsistency due to random 
sampling, but this value although about } of the real value is retained and used by him. 
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the standard deviation (mean square error) of an indefinitely great number of 
measurements which he might make of A for the given value of B. He attributes 
these variations in A to “errors of observation,” and he usually supposes such 
errors to obey the Gaussian law. This attribution is somewhat dogmatic. There 
is very little definite proof that errors of observation actually do obey the Gaussian 
law and secondly his “errors” are not in all probability solely due to observation. 
It is impossible to repeat each experimental measurement under precisely the same 
physical conditions for other variates C, D, H, ... and changes in these variates may 
be as influential as personal errors of observation. Further it is far from certain that 
the value of B has remained without some variation and this alone would tend to 
cause some variation in A. The physicist aims at a constant value of B—it is by 
no means certain that he always reaches it. Without much more investigation 
than is easy, or is at all likely to be made, we probably can at present do no better 
than assume the distribution of A’s for constant B to be Gaussian. Thus the distri- 
- bution of means determined by the physicist will have for its distribution surface 


—1y,2 
z= Me Lae 
; Ny (My — M,)* 
where v= S| ar ae 
On, 


and the corresponding value of the probability to be taken from the “ goodness 
of fit’ tables will be found by taking out P for the given value of x? under the argu- 
ment n’ + 1, where n’ is the number of arrays, for there are now n’ independent 
variables, i.e. m,’s. 


In the above value for x?, m, is the theoretical value of A corresponding to 
the given B, m, is the observed value and n, is the number of observations on which 
it depends. The real difficulty arises in determining om, Which is the standard 
deviation of the array for an indefinitely large number of observations and cannot 
be determined properly from the few observations made to determine the A 
corresponding to a given B. 


If there are a considerable number of observations in an array and Gp, be the 
standard deviation of the array found from the observations themselves, then 
it is well known that the “best” or most probable value of oy, 1s given by 


Seen 
| Oy 
S \ a) es 


o2 
Np 


2 
Ca — 


In this case yo 


a form which shows us that if we have arrays with only a single individual and 
use the observed Fn, 8, X° will be indeterminate, for n, — 1=0 and on, = 9. But 
the observed G», would be very risky for any system of small arrays, and this 
method of approaching the difficulty must I think be dropped unless the physicist 
be inclined to increase very much the number of measurements he makes of A for 
a given value of B. But the above method of approaching the subject indicates 
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how a single value of A for each value of B cannot lead to any measure whatever 
of the “goodness of fit.”” I suggest the following method of determining a suitable 
value for oj,. Let us assume that the arrays are homoscedastic, i.e. that the 
physicist’s difficulty in measuring A is the same for all values of B. This will not 
always be true, but is a fair working hypothesis for many cases. Then we have 

on, Opa Uh ats) 
where oy is the standard deviation of all the measurements made on A (without 
regard to the value of B), a much more reliable quantity than the standard 
deviation of any array values of A. Further 7,., is the correlation ratio of A 
on B or 

Pag = Sita (thy = 
No? 

where m is the mean value of all measurements of A and N is their total number. 
Thus 

2 S {Np (My cs Mry)”} 

og (I eee) 


can be readily determined as soon as m, o, and 7 4.g have been found. 


Of course it is needful for a test of this kind that the number of measurements 
of A should considerably exceed the number of values of B tested. It would fail 
entirely if only one value of A were taken for each value of B, however numerous 
the latter might be. We must have some basis on which to determine the error 
made in single determinations of A. This is a point I think often overlooked by 
the physicist. A fairly good determination—I mean a quantitative determination— 
of the goodness of fit of theory to observation could be made from 10 series of 
8 observations of A corresponding to 10 values of B. But no measure of “goodness 
of fit” could be found from 80 observations of A corresponding to 80 values of B, 
and yet the latter system would probably make the greater appeal to most physicists. 
I do not see how quantitatively to obtain any measure of the goodness of fit of 
theory to observation in the latter method of procedure. It is not unusual to 
determine the mean square residual, 1.e. 

S (Mp — My)*/N, 

but before we can really make use of this, i.e. find y? and so P, the probability of 
as great or greater divergence between theory and observation, we must know 
o°;,, and this can in no way be deduced from such a system of observations. In 
fact without a knowledge of o?;,—the unit in which (m, — m,)? is to be measured— 
the mean square residual is as delusive as the ocular comparison of a graph of 
the theoretical and observed results, where all turns on the arbitrary scale of the 
vertical ordinate. 

Illustrations. As illustrations I will take some of the data connecting length 
of are with loss of carbon per coulomb provided in a recent memoir by Professor 
Duffield*. My only reason for taking this material is that it is recent work and 

* R. S. Proc. Vol. 92, A, p. 125. 
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that I was as a mere statistician struck by the appearance of the curves as I turned 
over the pages of the Proceedings. I have considered the cases of 2 and 4 ampéres 
for the anode, Diagram I. The data for individual experiments are given on 
Prof. Duffield’s diagrams, but his tables only give mean values. I have therefore 
had to measure off individual values from his diagrams. The means obtained 
from these measurements agree in the main with the author’s recorded means, 
but in a few cases it would appear that there is some discordance between table 
and diagram. On the diagrams curves are drawn presumably representing in 
each case the smoothed relation which the author considers to hold between loss 
of weight and length of arc. My object is to enquire how far these curves are 
probable representations of the phenomena in question. It will be seen that the 
observations are usually made in pairs, more rarely in triplicate or are even 
quadruple. There are a considerable number of isolated observations. We have 
thus rather slender material from which to determine the error of a single observa- 
tion, and clearly but little conclusion could be drawn did we not assume that a 
measurement at each are length was of equal weight*. 


The following table exhibits the work—first for the 4-ampére curve; the 
weight lost being per coulomb x 10° grammes. 


ioht = 23 a 
feeth Np oa, My Mp Wp — W My — W My — My | S {Np (mp — Mp)*} 

1 1 17-62 | 17-620 | 17-52 — 3-32 2 ey) + +10 0100 

2) 17-92 — 3-02 

ae ea 18-110 | 18-24 = 5.64} ~ 2-83 ~ 13 0338 

. = “75 
2 } 2 Joga} | 19405 | 19-15 5 1aef — 1-54 + +25 1250 
19-70 | ae _ 1-24) os ee 

ie \ 2 Boa 20-010 | 20-12 o ae5 ~ 93 Sait 0242 
5 1 21-43 | 21-430 | 21-29 + +49 + -49 eld 0196 

7 1 21:35 | 21-350 | 22-26 AD fe aon 8281 

8 22-40 a + 1-46) 

e \ 2 Be 22-515 | 22-51 Pet + 1:57 00 0000 
10) BIT | oo. ES + 43 Sie a 

Beh 2 Sel 22-130 | 22-71 ate + 1-19 — -58 6728 
2 22-55) A + 1-61) 

: \ 2 See) 22slon |. 22779 nt + 1-87 + 02 0008 
15 1 22-69 | 22-690 | 22-79 + 1-75 + 1-75 210 0100 
20 1 22-89 | 22-890 | 29-79 + 1-95 1-95 + 10 0100 

: : Sum of squares | S{n,,(m,-— w)*} 1-7343 
TORE 0 Ti sos = 57-2166 ‘= BB-B422 — | = 840, (1p — Fp)*} 
4 aioe eet 2(1—7? roe 
G=20-94 & we o.2= 33657 | 3¢=3:2672 [721] 2217-607 
ae lies 12, “f. P = -0920 pe == =.9709 
O44" 


* This is probably not true, but there is no evidence of multiple measurements at points where the 
measurements might be assumed to be critical, 


W==9 
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Thus the probability, if the inscribed curve represented the phenomenon, that 
a system of observations as unfavourable or more unfavourable than the observed 
would arise is about 1 in 11, i.e. in eleven trials we should have had one result as 
bad. This is not very great odds (10 to 1) against the inscribed curve describing 
the facts, but it cannot be called a highly satisfactory concordance. 


I now take the anode 2 ampére’s data. The work is given in the following 
table: 


Are | Weight lost, no le = S ewe ae 
length | % Wp Mp Mp Wp — W My — W My — Mp Np (Mp — Mp) 
1 25-31) Pie — 4-80 
< } 2] d6-04f | 25°68 | 25-68 eee ~ 4-43 + -02 -0008 
2 26-04 ~ 4-07 
': } 3 207 26-59 | 26-46| — 3-34 — 3-52 + 13 0507 
i 26-95 IG 
3 Al" 26:93) cA er oulenegs _ 3-28 ; 
: \ 2| Sesoy [2787 |2768| = reat ~ 2-54 = am 0242 
4) 28-93 ae ~ 1-18 
Sloe Sul 29:04 | 30°73 E 36} ~ 1-07 ~ 1:69 57122 
6 34-00 + 389 
3 34:28 | | , +417 
i An aia 34:57)|3446 7 oe + 4-46 i 0484 
sf 35-11 + 5:00 
8 33-98 + 3:87 
3 } Be aie 11 94-40) a G1 ame ney + 4-38 = a0) 0288 
1st KTR ea ___ | Sum of squares | S {n, (m,- w)*} 5-8651 
Motels a ee = 212-92 = '209:51 — | = 84% (a0) - mp)?} 
ae 5 ; D(iloae 
B=3011 | — | — |o,?= 14-1947 | 32 = 13-9673 [74 Oo)| ya = 25-72 
: ' >? 
Ve 16 ee 0003 1? = —, =°9839 
C4 


Thus only three times in 10,000 trials if the inscribed curve actually represented 
the phenomenon would a series of observations so widely divergent as those 
observed arise. In other words the inscribed curve must be definitely rejected 
as a probable description of the series of phenomena. Now the great advantage 
of this ‘‘ goodness of fit’ method is that by the very working of it out the actual 
regions at which the theoretical results diverge with a maximum of improbability 
from the observations are indicated, and the investigator is able to say here are the 
points where discordance is greatest and where theory or observation needs modi- 
fication. Clearly in this case the whole burden of the discordance falls on the 
observations at arc length 4. The special examples selected are of no real import- 
ance; the author probably laid no stress on his inscribed curves, and a little better 
draughtsmanship might have bettered them to some extent. They are used simply 
to illustrate that a new instrument is ready to the hand of the physicist. He must 
have felt very desirous at times in the past of being certain how far his observations 
were in accord with theory. How many times must he not have put to himself the 
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question: What is the probability that my theory describes adequately my facts? 
Meanwhile the statistician who examines the physicist’s diagrams has often been 
forced to smile by the degree of discordance which the physicist has allowed to 
pass as if some graph could demonstrate adequately the harmony of physical law 
and experiment. I look forward to the time when no physical paper will be 
considered complete unless it provides at the end of each series of experiments the 
value of P, i.e. the measure of the goodness of fit of observations recorded to 
theory adopted. There will be no excuse, there really is no excuse, now that 
tables are provided, for its omission. It is always possible in the course of an 
hour or so’s arithmetic to measure the accordance between supposed law and 
recorded observation. 


One of the important steps in the work given is undoubtedly the measurement 
by means of the correlation ratio of the mean square error of the physicist’s 
determinations. I think that is undoubtedly the best way of finding it. The 
physicist is apt to use “mean errors” instead of mean square errors. He does not 
recognise that the probable error of a mean error is sensibly greater than that 
of a mean square error. But he may wish in the present case to have some test 
of the accuracy of the determination of o,, the standard deviation of an array, 
from other processes. One which it appears to me ought to be satisfactory is the 
following: Let the observations of A for a given value of B be taken in pairs, 
and let all these pairs of A values be formed and their differences x,— a, be taken 
in each case, x, being greater than x, then the following relation should hold if 
the distribution of errors be Gaussian : 

_ Mean (x, — 2)? — (Mean (x, — 2))? 
-72676 : 
For Professor Duffield’s 4-ampére curve this gives 
o,° = 2444 against o,?(1 —7*) = ‘0977, 
and for his 2-ampére curve 
G42 = -1987 against o4? (1 — 47) = -2274. 

The latter, considering that we deal only with 13 pairs, is fairly accordant; the 
agreement in the case of the former is very poor, but in this case there are only 
six pairs. I have purposely introduced this case, because no real verification of 
the 7 method could be obtained on the basis of six pairs only, and yet for the 
4-ampére curve the whole question of whether the graph is a reasonable description 
of the data actually depends on the existence of six paired observations in the 
total of 17. We are really left without adequate material to determine effectively 
the probablé error of any observation. This may be of no importance in the 
present case, but the absence of adequate repetition of the value of A for a given 
value of B in order to determine the probable error of observation and so the 
“goodness of fit”’ of observation to theory is characteristic of much current physical 
research. 

Iam much indebted to Mr Andrew W. Young for algebraical and arithmetical 
aid. 


Gee 


ON THE *BEST’ VALUES OF THE CONSTANTS 
IN FREQUENCY DISTRIBUTIONS. 


By KIRSTINE SMITH. 


(1) If we attempt to fit the normal or Gaussian curve to a system of observa- 
tions, we almost invariably determine the constants @ and o of the equation 
1 (vw — 2)? 
y= es e fs 5 o 
V 20 o 
by the method of moments. This method of moments has been extended by Thiele, 
Pearson, Lipps and others to obtain the constants involved in various skew 
frequency curves and series. It is an undoubtedly utile and accurate method; 
but the question of whether it gives the ‘best’ values of the constants has not been 
very fully studied. It is perfectly true that if we deal with individual observations 
then the method of moments gives, with a somewhat arbitrary definition of what 
is to be a maximum, the ‘best’ values for o and % in the above equation to the 
Gaussian. Pearson* has shown that the method of moments agrees with the 
method of least squares in the case where the distribution is given by a high 
order parabola, and accordingly the method of moments is likely to give a very 
good result, when an expansion by Maclaurin’s Theorem would closely give a 
frequency function. But the method of least squares itself can now-a-days hardly 
be spoken of as more than a utile and accurate method of fit, indeed its utility, 
owing to the cumbersome nature of the equations which frequently arise, is often 
far less than that of the method of moments. 


Gauss’ original proof that the probability of the observed individual results 
was a maximum when & and o have been determined by moments has led to the 
extension of the conception that for grouped data, and for other results than the 
Gaussian curve, the ‘best’ values of the constants must be given by the lowest 
possible moments. This is of course not true For example, if we had as fre- 
quency curve 


Y = Yoe , 
and used individual observations, then the Gaussian ‘best’ value for % would be 
that found by determining the point for which the third moment coefficient 


* Biometrika, Vol. 1. pp. 267-70. 


abate fe bags 


KirstinE SMITH 263 


vanished, and the ‘best’ value of o would be determined by o = \/py4, where py, is 
to be taken about the point for which p, = 0*. 


From another standpoint, however, the ‘best values’ of the frequency constants 


may be said to be those for which 
S (ns a Te 


Pj EE 
x i, 


is a minimum, where n, is the observed frequency and 7, the theoretical frequency 
of the sth groupt. For when x? is a minimum then P, the probability of occurrence 
of a result as divergent as or more divergent than the observed, will be a maximum, 
or the frequency constants will have been so chosen as to make the probability 
P of results, as divergent from theory as the observed data occurring, a maximum. 


It sounds somewhat paradoxical, but it is none the less true to say that the 
“best value’ of the mean is not necessarily the mean value, nor the ‘best value’ of 
the mean square deviation necessarily the mean square deviation{. I shall illus- 
trate this in the following five cases: 

I. Fit of a normal curve to unilateral data. 

II. Fit of a normal curve to bilateral data. 

III. Fit of a Poisson limit to the binomial. 

IV. Fit of a binomial to binomial data. 

V. Fit of regression lines. 


The general method is as follows. Suppose f to be any independent frequency 
constant; then y? is to bea maximum with the variation of f. Accordingly we have 
from 

1 a x? = 3 (25) 
Ns 


* University of London, Honours B.Se., Papers in Statistics, Thursday, Oct. 28, 1915. 

+ Phil. Mag. Vol. u. p. 157, 1900. 

There is a point of some philosophical interest here which deserves further consideration. As is 
well known the Gaussian demonstration depends on making the product 


P | E —¢@ 2 0 , 
‘a 210 J 

s being taken so as to include each individual observation, a maximum by varying o and &, the result 
being that the ‘best’ values are found from the first two moments. Now it will be observed that this 
is not the same idea as lies in the x? test of goodness of fit. The conception of ‘goodness’ in that case 
is that we should measure the probability of a drawing from a certain population giving as divergent 
or a more divergent result than that observed. In other words while the Gaussian test makes a single 
ordinate of a generalised frequency surface a maximum, the x? test makes a real probability, namely 
the whole volume lying outside a certain contour surface defined by x? amaximum. Logically this seems 
the more reasonable, for the above product used in the Gaussian proof is not a probability at all. To 
make it a probability it must be multiplied by the product {éx,}, and then the probability of the actually 
observed result, namely x1, wp, ... @, ... Yq, Will of course be infinitely small, and what is made a maximum 
is an infinitely small probability. The exact meaning of P {6z,} when a, is an actual observation is 
obscure, but it appears that the probability for constant indefinitely small ranges of the variates in the 
neighbourhood of the observed values is made a maximum. But probability means the frequency of 
recurrence in a repeated series of trials and this probability is in the case supposed indefinitely small. 
It seems far more reasonable to make a finite probability, i.e. the probability of a divergence as great or 
greater than the observed a maximum, i.e. to use the x? test and not the Gaussian principle. 
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a number of equations of type 


iS (f a) OF Ail (1). 


These equations will generally be far too involved to be directly solved. Accord- 
ingly we proceed thus: We suppose that the values of the frequency constants 
given by the method of moments are good starting-points, and we put, if f denote 
the moment value of a frequency constant, f= f+ Af. Accordingly if there be 
a number fi, fy, ...f, of independent frequency constants, we shall have a series 
of equations to find Af,, Af,, ... Af, of the type 


o=sfee]}+ 968 Se] —2 ge} 


+f) -aL Dh 


oO (F4] EG 00, 


where a square bracket round the differential coefficients signifies that the frequency 
constants f,, fy .-.,f, therein are to be given their moment values f,, fy, ...f,. These 
values are of course also to be used in 7,. 

Since S(7,) = N, it is clear that 


dn dn di, 
S Ga =S (FG) S Gaz —ACtCe—a) 
Accordingly the above equations may be reduced to the type 
(ne — n2 [dn ne — n,*| dn, Qn, | /dn,\? ; 
0s Pe Lael} esas Lael — ae LC) 1 
n~— 7, d?n, dn, dn, 
+6 {oa lange] — ae Lat a |} 
ne — n,7| dn, 2n,? | dn, dn, 
+P Lael at Lar gl} Me 
It might reasonably be anticipated that terms involving the product of Af 


and (n,? — 7,?)/7,2 could be neglected in the first place and accordingly that we 


should have as approximate type 
Preiser meen dir, ie n° oy ns? | di; din, 
as Pas alts Lialge) [+ Ges Lag af Ih 
n.? { dn, dn, 
+ BLES laa TAN NOB 5 (26), 


but this approximation has not in every case numerically justified itself, and thus 
it cannot be invariably used as more than a reasonable starting-off point. 


(2) Fit of a Normal Curve. 


1 (x — m)? 


’ se ‘ N  [%n —= 
Differentiating 1, = —— Cire F208 dae: 
V20ro 


| Us 


i. ae 


vides! Suit 6 ty 


and then putting 
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anes vaio — ves 


we have on substituting the differentials in (2 a): 


3 Ns” 
; is 


n 


2 ne 
+ 6m (842 Dhateas — hetlh + 208 (FS [era — oP) 


1 


q 


i Xe (s {Fs 


zl} 


ne 
l= 244 ae Zs a We Ze — hee} = 2N S a L2s4a con Z-| [he s12st1 aa healt) 


n 2 
0= oS ei [hes 2s44 ad heel} 


2 ; 2 
+ Am (s {ee lies Zest 2g Ws 2s4a— nee} +2NS ie [2541 = Zs] [hsir%s1 me healt) 


+ AG (s ies 


nee 


(3412541 radi hz.) ay. (heii 2s44 ga hed) zis 2NS re el e534 hes?) 


the differential coefficients of y? being 


and 


d(x”) 
dm 


d(x?) _ 


da 


aN 


ay 
a 


~ «ah 


M4” 
S iF (heer. — heed} 


Illustration I. Fit of Normal Curve to Unilateral Data. 


Our first illustration treats a series of measurements by Bradley discussed 
by Bessel*. The mean of the observations is fixed, for in dealing with the observa- 
tions Bessel has added positive and negative variations together. 


* Emanuel Czuber, Theorie der Beobachtungsfehler, p. 192. 


TABLE I. Measurements of Right Ascension. 
: . Gaussian curve 
Limits Observed AAUSS LU CUTS improved by 
by moments minimum x2 
0”-0—0"-1 114 101-61 98-63 
0”-1—0”-2 84 84-12 82-59 
0”-2—0":3 53 57-65 57-91 
0”:-3—0"-4 24 32-71 34-00 
0”-4—0"-5 14 15-36 16-72 
0”-5—0”-6 6 5-974 6-881 
0”-6—0”-7 3 1-923 2:372 
0”:7-—0”:8 1 -5122 6843 
0”-8—0”-9 1 -1370 2053 


Fundamenta Astronomiae for the original data in order to remove the unilateral limitation. 


Search has been made in vain in the 
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@ was found equal to 2-282542 and the second formula of (3) gave the value 
2-341735 for o. As Ao was found so large that the approximation could not be 


d (x? 
expected to be very good, the following values of . ) were calculated from the 
Oo 
second formula of (4): 
d(x’) 
v do 
2-282542 — 32-53 
i 
= 12-3955 = 
43 325581 11-81 
5 = 2-380952 + 8-06 


By interpolation in this table o = 2-355860 was found as the value for which 

d (x?) 
do 

From x? the ‘goodness of fit’ P was found: 


equals zero, and this is the o of the improved Gaussian given above. 


l 
d (x?) 
x P | do 
Gaussian ae 10-833 0-211 — 32-53 
Impr. Gaussian 9-720 0-285 + 0-20 
| 


As will be seen the better fit is obtained by making o bigger than the Gaussian 
value, the improvement therefore cannot be looked upon as a correction for grouping. 
On the contrary the Sheppard correction would have given o = 2-264214 and 
have raised y? to 11:52. Thus we see that although the two methods give close 
values for P, the ‘better value’ is obtained as it should be from the lesser value 
of d(x?)/do. 


(3) Illustration II. Fit of a Normal Curve to Bilateral Data. 


For the next illustration I have used a table giving frequencies of cephalic 
index in Bavarian skulls*. Both o and m have here been varied. As the formulae 
(3) are somewhat laborious to work with, the approximations were used roughly 
suggested by the process on p. 264, but the results were not satisfactory ¢ and these 


approximate results are therefore not given here. But g ox x") and ye x’) for the two 


* J. Ranke, Beitrdge zur physischen Antropologie der Baiern, Miinchen, 1883. The table includes 
the material from Tables I-VI and VIII-X inclusive which may be treated as typically ‘ Alt-Baierisch.’ 
d(x?) 


+ In fact the calculation of the exact value of oe showed that the part of it neglected in 


formula (26) was about ;!; of the whole value. It essentially arose from the one tail group, this being 


200 of the whole neglected part. As a use 
& 


for this group was only as big as 1:0348, the approximate 


formula (2b) cannot be expected to be of great value for the normal curve. 
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Gaussians found in this way were used for interpolation purposes and their constants 
are therefore given in the following table under (6) and (c). By (a) is indicated 
the Gaussian from which we started, namely that found by moments, Sheppard’s 
correction being used. 


2 2 
Assuming . and a) to be linear functions of o and m, we determined 


from the cases (a), (b) and (c) values of o and m, given under (d), so as to make 
the differential coefficients zero. In the same way we found at last from the cases 
(a), (b) and (d) the constants of the Gaussian (e), the constants of which will be 
found in the following table. As will be seen we have succeeded in bringing the 


d(x") ang 2{x”) 


values of ar as near to zero, certainly close enough for all practical 
purposes. 
TABLE II. 
d 2 d 2 
(a) 83-06889 3°431833 10-205 *895 — ‘57 + 14-42 
(b) 83-01498 3°358380 10-301 891 — 10-56 — 9:97 
(c) 82-98832 3°331365 11-048 854 — 15:89 — 20-76 
(d) 83-05329 3°349421 10-108 -899 — 4-59 — 12-10 
(e) 83-07774 3°385991 9-858 -909 + 07 + <7] 
TABLE III. 
Gaussian Gaussian. 
Observed curve by improved by 
moments minimum x? 

75 and under 9-5 12-3387 11-3504 

76 12-5 12-6842 12-0767 

77 17 22-0702 21-3463 

78 B7/ 35-2942 34-6005 

79 55 51-8794 51-4323 

80 71:5 70-0925 70-1100 

&1 82 87-0421 87-6432 

&2 116 99-3519 100-4734 

83 98 104-2329 105-6275 

84 ‘107 100-5128 101-8352 

85 82 89-0879 90-0352 

S86 74 72-5781 72-9998 

8&7 58 54-3468 54-2778 

&8 34:5 37-4049 37-0099 

8&9 19 23-6625 23-1422 

90 10 13-7588 13-2703 

91 8 7:3532 6-9782 

92 and over 9 6-3093 5:7910 


(4) Fit of a Porson Linut to the Binomial. For a Poisson limit with the 
Saas 


ms 
“i we find 


e 
general term 
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and putting m = m+ Am, 


Starting with m equal to the mean of the observations I have found the 
improved values in the following two numerical examples. 

Illustration III. The first table given by L. Whitaker* contains the number 
of deaths per day of women over 85 years, published in the Times newspaper 
during the years 1910-1912. 


TABLE IV. 
Number : Poisson 
of deaths Observed Poisson by first improved by 
per day moment minimum 2 
0 364 336-250 331-1383 
1 376 397-302 396-334 
2 218 234-720 237-186 
3 89 92-446 94-630 
4 38) 27-308 28-316 
5 13 6-4532 6-7782 
6 2 1-2708 1-3521 
7 1 0:2508 0-2715 
d (x? ; we 
The m, x?, P and oe ) calculated from (5) were determined for the two distzi- 
butions as given in Table V. 
TABLE V. 
Le Sac 
= | Sad 
Poisson ... sisi 1-181569 15-226 0332 | — 35-61 
Poisson ISOS 1-196903 | 14-943 0361 | — 0-75 


Illustration IV. As our second illustration we have taken a table of phagocytic 
frequencies published by Major McKendrick f- 


TABLE VI. 
No. : Poisson 
of Observed pe OeON by A improved by 
Deaths Tet momen minimum x? 
0 620 605-924. 600-676 
1 282 303-568 306-164 
2 79 76-044. 78-026 
3 16 12-699 13-257 
4 2 1-5906 1-6892 
5 1 “1738 -1881 


* Biometrika, Vol. x. p. 67. . 
+ Proceedings of the London Mathematical Society, Vol. x1. 1913, p. 401. 
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The numerical values of the constants of the series and of the ‘goodness of 
fit’ are 


TABLE VII. 
d 2 
m | a P oa 
Poisson ... is -501000 6-865 231 — 41-86 
| Poisson improved -509700 | 6-672 246 - 1-21 


This table is of interest because it illustrates the apparent paradox, already seen 
in the case of the second Gaussian curve illustration, that oo mean’ is not neces- 
sarily the ‘best value’ of the constant termed the ‘mean.’ 


(5) et of a Binomial to Binomial Data. 


Let n, be equal to the (s+ 1)th term of the binomial (p+ q)', where 
p+q= 1, or to 
Ld—1)...@@—-s+1 
poopy Da 0 + 0. 


we then find 
dn, l—pl—s _ m—s 


=e Ns , 
dp "*p(1—p) pil— p) 


where m is the mean or stand for / (1 — p), 


dn, Ns i 2 : 9) 
Ge OSI Nae Pla = Sa PLS) (2p) — lp (py 


Oe Ate —s)2 — s) (1 — 2p) + mp 
BE — py {lm — 8)? + (m — 8) (1 — 2p) + mp}, 
oe (i Deere Ne cee h ) 
Fem fs (log. +7 +75 Pe gly eer oT 
dn, _ (10 ye Lis a 1 2 ] 1 
CLE SUA 2A a a Saar ea i) Ce is (=r 


= dim — 8) (loge +5 + pt tpg) + 
dp pa—p)\” eee areca) 2): 
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and the equations (2a) take the form 


nee 8 Hig 
Sg aay & Geman 8)? + (m — 8) (1 — 2p) 4 mpl) ap 


18( | 5) (logep+ F+ pate +g ~ )- (1—p)]) al 

s (3 [lowe + 4 aan tp i]) (2), 
=8(s0=p|@- (lopt zt at tiar) a »)|) An 
+s(7= (lop p+ i %j pore an) tet apt tase |) 


while the approximate formulae of the type (2b) are 
S (2 wt =<) a5 Gey [m sp) Ap 
ms p(l—p)/ ~ ae apy er ees 
cae aut bare eee ae 
| 8 (aati =) lim #) RPT eg Le hae see tae 


Nn 1 1 1 
asi z pa ens 9). 
S- [lower + 7+ py attri) (9) 


Ne 


| reg | l 
SOS ( trial +t ey 
28 Gan = p) Hm s) (loge P T ] I fai > ooo SP ape ))) Ap 


(Ms oneal eel 
+ 25 (T= = [lowe e+ j Fee ral)” 


Illustration V. Weldon’s Dice Data. 


For illustration are used the following data due to the late Professor W. F. R. 
Weldon*. They give the observed frequencies of dice with five or six points when 
a throw of twelve dice was made 26306 times. 


TABLE VIII. 


Number of dice Observed. | Binomial by Improved binomial Improved binomial 
in cast with 5 fected method of (a) by x? a (6) by x? a 
or 6 points 1 y moments mininum minimum 
0 185 189-679 190-651 190-659 
1 1149 1154-441 1157-607 1157-600 
2 3265 3223-426 3226-085 3225-959 

3 5475 5461-01 5458-07 5457-78 

4 6114 6253-64 6245-98 6245-71 

is) 5194 5101-31 5095-82 5095-79 

6 3067 3041-04 3041-47 3041-69 

id 1331 1335-82 1339-55 1339-81 

8 403 429-627 432-815 432-984 

9 105 98-865 100-351 100-419 
10 14 15-5133 15-9413 15-9595 
11 4 1-57640 1-65879 1-66210 


* Phil. Mag. July, 1900, p. 167. 
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Fitting the frequencies from the end by means of two moments we obtain the 
binomial 
(6658208 + °3341792)12-126879, 
the terms of which are given in the table above under the head Binomial. 
From these starting values of p and / we found by the equations (8) the constants 
of the improved binomial (a) p = -6674922 and 1 = 12-188945. 
A comparison between the coefficients of the two sets of formulae (8) and (9) 
gave the result that they only diverged by between 1-4 and 5 per mille of their 
hae) 
value. As cZsiwowlist for the tail group was as big as 5-44, we are from this justified 
Ss 
in expecting the approximate formulae (9) to be useful for binomial data. 
Starting from the improved binomial (a) another improved binomial (b) was 
found by means of the formulae (9). As will be seen I only succeeded in 
2 2 , 
diminishing He by raising 208 , and x” came out with exactly the same value 
as by the former formula. The constants for the improved binomial (b) are 
p = 6675432 and J = 12-191141. 
The constants illustrating the ‘goodness of fit’ were found as follows: 


TABLE IX. 
d(x?) d(x*) 
x" zs dp dl 
Binomial nae ae 11-648 “390 159-47 — 8-15 
Improved Binomial (qa) 11-513 401 26-02 _— 84 
5 (b) 11-513 ‘401 — -02 + 1-96 


It will be seen from the above illustrations that the probability of happening 
as determined by the y? test of ‘goodness of fit’ being a maximum can always be 
made somewhat greater than the same probability deduced from a fit by the 
method of moments, which at any rate for the Gaussian curve is usually assumed 
to be the ‘best.’ 

(6) On the ‘Best’ Values of the Constants of Regression Curves. 

If we apply the test of ‘ goodness of fit’ to regression curves as recently indicated 
by Pearson* modifying Slutsky’s methods, we shall experience the same divergence 
between the curves of regression found by the method of least squares and the 
curves calculated so as to make y? a minimum, as we found when dealing with 
frequency distributions. 

In the paper cited x? for a regression curve is given as 

as (Sette — Pel) Met Hee cuba sotets (10), 
on, 


* Biometrika, Vol. x1. pp. 239 et seq. 
+ Journal of the Royal Statistical Society, Vol. LXxvit. pp. 78-84. 
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where m is the mean of the pth array of the sample of size M from a population 
of size NV, while m, is the theoretical mean as found from the regression curve, 
Np, = Ny M/N, is the mean frequency and on, the mean standard deviation of the 
pth array in the samples. The difficulty in applying the ‘goodness of fit’ test 
lies in finding adequate values for n, and oj,. Let us assume them to be found. 
The ‘best’ values of the constants f,, f,, ... of the regression curve, i.e. the values 
which make x? a minimum, will then be found from equations of the type 


a ee 
0 ag {-"2 (mm, — m,) om A mM»)? | 11 
= h ae Me My) df, + (M,— My) Ta Biase (Guin): 


As will be seen these equations fall into the equations resulting from using 


the method of least squares if =. is independent of the constants of the regression 
oO Np 

curve and at the same time for the different arrays proportional to the n, of the 

sample. Even if our sample be derived from truly Gaussian data, these conditions 

will only approximately be satisfied, the oj,, although constant, being dependent 

upon the constants of the regression curve and the n, of the formula not being 


really the sample value. 


2 to be independent of the constants of the regression line 
orn 


M, =ax+b, the equations (11) take the form 
S {v, (m, — ax — b) x} = 0, 
S {v, (m, — ax — b)} = 0, 


Supposing 


when we put v, for 


From these equations we find 
S (vpmye) . S (Vp) — S (Upmy) S (x) 
S (Upv*) JS (Up) — tS (Upx)}? 
S (VpMy) _ e Se(U52) 
S (vp) VSG) « 
formulae agreeing with those derived from the method of least squares if v, equals 
the marginal frequencies of the sample. But not agreeing with them if, for example, 
the material be heteroscedastic. 


(7) Illustration VI. Auricular Height of School Girls. 


This example was first used by Pearson in the memoir on skew correlation* 
and later as an illustration of the test of ‘goodness of fit’ of regression curvesf. 


and v= 


* Drapers’ Company Research Memoirs, Biometric Series 11. p. 34. 
+ Biometrika, Vol. x1. p. 253. 
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For the present use theoretical values of n, and o,2 were determined, from which the 
values of v, given in Table X are calculated. Then, and v, of the table represent the 
weights given to the means of arrays respectively by the method of least squares and 
by our method of making y? a minimum. It will be seen that our method throws 


TABLE X. 
ee Vp me | f an oe f or ee th 
“0 observed : observed a See squares 
= — = a | — ——— — — 
3—4 1 5:3790 115-25 | 117-76 117-95 
4—5 i 13-7170 116-96 118-44 118-61 
5—6 18 28-5973 117-47 119-13 119-27 
6—7 40 56-0527 119-10 119-81 119-94 
7—8 76 95-3828 120-30 120-49 120-60 
8—9 125 146-023 121-63 121-17 121-26 
9—10 177 199-783 121-72 121-86 121-92 
10—11 235 243-414 122-82 122-54 122-59 
= =12 261 271-704. 123-14 123-22 123-25 
12—13 309 277-232 123-89 123-90 123-91 
13—14 263 259-386 124-86 124-59 124-58 
El) 198 223-505 125-71 125-27 125-24 
15—16 214 172-851 126-16 125-95 125-90 
16—17 162 121-965 126-53 126-63 126-57 
17—18 95 75-7303 126-91 127-32 127-23 
18—19 61 43-0926 127-02 128-00 127-89 | 
1920 13 21-2448 129-56 128-68 128-55 | 
RO 7 8-09110 123-82 129-36 129-22 
2i—22 8 6:42326 126-50 130-05 129-88 | 
5} 2 2-42653 125-25 130-73 130-54 | 


the weight more to the first half part of the groups of ages than the method of 
least squares. This is due to the heteroscedasticity of the material, the o%;, 
varying from 27-2776 in the youngest group to 60-4676 in the eldest. The two 
last columns of Table X contain the m, calculated from our regression formula 
and from the usual formula; as might be expected our m,’s are closer to the 
means of the observations for the younger groups of ages and differ more for the 
higher ages than do the m, values obtained by the method of least squares. The 
x? calculated by (10) are for the two cases 18-45 and 18-67 and we have only raised 
the ‘goodness of fit’ P from ‘543 to ‘558 although the weighting in the two 
methods appeared sensibly different. 
The usual regression line is 

M, = 124-0467 + -662979 (x, — 12-7007), 
124-0467 and 12-7007 being the general means, and regression line from the ? 
formula may be written 

M, = 124-0411 + -682455 (x, — 12-7007) 
from which is seen that it passes not far from the mean. 


In a similar way I have treated the regression of ages on height of head. Also 
I have here calculated the heteroscedasticity and have had to use a parabola to 
Biometrika x1 18 
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Age in Years 


Dracram I. Comparison of Regression Straight Lines found by method of 
Least Squares and by x? Test. 
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Dracram II. Scedastic Curve of Age on Auricular Height in Girls. 
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represent o%;,, the squared standard deviation of the arrays of same height, to 
obtain a reasonable description; this is shown on the diagram. The marginal 
frequencies of the height variate could be expressed fairly well by a Gaussian curve. 
These theoretical values of 0°; and n,' are given in Table XI together with the 
weights 


y= x = 
GH 
o Ny! 
calculated from them. 
TABLE XI. 
ter oH! Ny i Ny’ My ip Mey dora 
Millims theoretical | theoretical v bbacrved observed eons x ae 
102-25—104:25 8-456 4:73 4-:7123 2 5:00 9:92 9:99 
104:25—106-25 8-748 6-62 6:3809 10 10-40 10:18 10-25 
106:25—108-25 8:987 13:89 13-0282 10 11-10 10-45 10-51 
108-:25—110-25 9-172 26:80 24-6339 27 11-54 10-72 10:77 
110-25—112-25 9-304 47-59 43:°1217 56 11-71 10:99 11-03 
112-25—114-25 9-382 77°76 69-8648 59 11-81 11-26 11-29 
114-25—116-25 9-408 116-90 104-750 115 11-62 11-53 11-55 
116-25—118-25 9-380 161-71 145-332 142 11-70 11-80 11-81 
118-25—120-25 9-298 205°83 186-597 244. 11-80 | 12-06 12-08 
120-25—122-25 9-164 241-06 221-744. 265 12-15 12:33 12-34 
122-25—124-25 8-976 259-78 243-960 261 12-52 12-60 12-60 
124-25—126-25 8-735 257-59 248-580 265 12-83 12-87 12-86 
126-25—128-25 8-441 235-02 234-710 219 12-98 13-14 13-12 
128-25—130-25 8-093 197-30 205-508 197 13-78 13-41 13-38 
130-25—132-25 7-692 152-41 167-023 131 13-85 13-67 13-64 
132-25—134-25 7:238 108-33 126-167 88 13:78 13:94 13-90 
134:25—136-25 6-730 70:85 88-7361 7 14-28 14-21 14-16 
136:25—138-25 6-170 42-64 58-2529 52 14-40 14-48 14:42 
138-25—140-25 5-556 23-61 35-8204 20 14-05 14:75 14-69 
140-:25—142-25 4-888 12-03 20-7416 16 14-56 15-02 14-95 
142-25—144-25 4-168 5:64 11-4040 11 14:95 15-29 15-21 
144:25—146-25 3°394 2-43 6:0407 4 18-00 15-55 15:47 
146:25—148-25 2-567 1-49 4:8835 1 19-50 15:82 15-73 


The usual regression line is 
M, = 12-7007 + 130489 (Yp — 124-0467), 
and the line for which yx? is a minimum is 
My = 12-7071 + +1342345 (y, — 124-0467). 


For x? were found in the two cases the values 44-411 and 44-109 and for the 
‘goodness of fit? P the values (0047 and -0051*. 


* A case was purposely chosen in which the regression was known to be far from linear, in order to 
ascertain whether this fact itself would separate at all widely the least square and x? regression lines. 
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The intersection point of the two x? regression lines is m= 124-0453, 
m’ = 12-7070, which is seen to be very near to the general means. Introducing 
that point into the equations of the lines, they take the form 


My’ = 127070 + -1342345 (y, — 124-0453), 
My = 124-0453 + -682455 (av, — 127070). 


From the slopes of the lines we find the value -3027 for the correlation coefficient, 
whereas the method of least squares gives the value -2941. 


Although we have found the material to be decidedly heteroscedastic and the 
weighting of the two series of means rather different from that of the marginal 
frequencies, we nevertheless see that the resulting regression lines differ very little 
from the ordinary regression lines, both the deviations of the means and the 
correlation coefficient derived from them being less than their probable errors. 


(8) The conclusions to be drawn from the present investigation are: 


(i) The definition of ‘best,’ which leads to the method of moments being con- 
sidered ‘best’ and incidentally to the method of least squares being considered 
‘best,’ is undoubtedly somewhat arbitrary. If we use Pearson’s ‘Goodness of Fit’ 
test, then the method of moments is not necessarily the ‘hest,’ the best value of 
the constant termed the mean is not necessarily the mean, nor generally the best 
value of the correlation coefficient between two variates that calculated by the 
moments and product moment method. 


(ii) On the other hand the present numerical illustrations appear to indicate 
that but little practical advantage is gained by a great deal of additional labour, 
the values of P are only slightly raised—probably always within their range of 
probable error. In other words the investigation justifies the method of moments 
as giving excellent values of the constants with nearly the maximum value of P or 
it justifies the use of the method of moments, if the definition of ‘best’ by which 
that method is reached must at least be considered somewhat arbitrary. 


The present paper was worked out in the Biometric Laboratory and I have 
to thank Professor Pearson for his aid throughout the work. 


MISCELLANEA. 


Note on the Standard Deviations of Samples of Two or Three. 


By ANDREW W. YOUNG, M.A. 


In an “ Editorial” contained in Vol. x of Biometrika*, there is a discussion of the distribution 
of the values of the standard deviation of a population which are deduced from small samples of 
the population. It is there shown how the distribution approaches normality as the number, 7, 
in the sample increases, a table of the characteristic constants of the frequency curves for various 
values of n being given. The smallest sample considered is that of n = 4, but samples of two and 
three are of occasional occurrence especially in physical work and now and again a value of the 
probable error of an experimental result is deduced from a set of two or of three observations. 
A knowledge of the theoretical distribution of the standard deviations for such small samples 
will give us some idea of the reliability of this procedure and it is the object of this note to supply 
this omission from the former paper. 


“Student’s” formula for the distribution of samples of standard deviation is 


= YoS"—2e 2 
where o is the standard deviation of the whole population and & is the standard deviation given 
by a sample of size n. Thus the distribution for samples of two is 


32 
Y= ten 
extending from 3 = 0 to 5 = o, ie. the distribution is simply half of a normal curve; and the 
distribution for samples of three is 
y =yode 2%, 
also, of course, extending from = = 0 to >= om. 
It is easy to find by direct integration the moment coefficients of these curves. 


Case of Samples of Two. 
If we denote by N the total number of samples in the assumed distribution, 
N=% | 
Taking the first four moment coefficients about the origin to be jay’, po’, 3’, #4’, and the moment 


coefficients about the mean to be, as usual, py, ws, fa, We have 
>2 


32 
2 mee Nor 
‘ é ds = Yo SGT o 
) a 


co eal 2 
py’ = Mean value of 3=3 = ” | se ds = ae = _ = -5642c. 
Jo 7 
Ss 
, —_ Yo See ee Yo Nw oF 
= > oF — a 
Me ie Sees reicas So, 
1 ] 
pee nO ay ee oe ee 2 
giving: po =o G = 1817o?, 


and the standard deviation of 5 = os = -4263¢. 


* “On the Distribution of the Standard Deviations of Small Samples.” Appendix I to papers by 
“Student” and R. A. Fisher. Biometrika, Vol. x. p. 522, 1915. 
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Pon Ui ae a Of Ge a? 
2) steht 
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ae a eee II ‘ 
giving: Ps = Rie (¢ = 5) = ‘0771c°. 
Yo [oa 738 Yo B0°N 
, 0 ap} oO 7 
Gs ste &gs — Yo Se 
ba I. é dS N 8 a0o ; 
giving: fhe SO é + Bee EL = ‘127704. 
4 fr 72 


From these we derive 
B, = °9906, 2 = 3°8692. 


Case of Samples of Three. 
In the same way 


ig Bor 
[pe 3° 
9) ‘ e 

giving: po = 0° ie = 3) = -1431o?; 
and Oy = 3782 

row = ao ae 
ene pg = 0? RE i q r 1) = -03420°, 

pa’ = $04, 

Ses 8 2 

giving: a= oe G - oy = 066404, 
and B, = 8988, 8» = 3'2451. 


Modal Values. 

In the case of n = 2, the mode of the theoretical curve is at the origin 5 = 0, but it is to be 
borne in mind that it is the areas of strips of the frequency-curve which are to be used to estimate 
the probability. In practice, therefore, seeing that all measurements must be made in discrete 
amounts and cannot be mathematically continuous, we can only assert that the most frequently 
occurring values of = are those which are nearest zero—not actually zero. The example given 
below will make this clear. 


For n = 3, the mode is obtained by differentiating the equation 


This gives, for the mode, 


thus the modal value § of 3 is given by 


Skewness. 
The skewness of the distributions is for n = 2, 1-3236, and for n = 3, -3867. 
Thus whether we consider the mean or the modal values of the distributions, it is evident that 
the probable error determined from a set of three observations is very untrustworthy and that when 
there are only two observations it is very much worse. 
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With the preceding calculations we can now complete the Table on p. 529 of Biometrika, 
Vol. x, with the following 
TABLE I. 


Table of Values of the Constants of the Frequency Distributions of the Standard 
Deviations of Samples of Size 2 and 3 drawn at random from a Normal 
Population. 


Standard Deviation | Measures of Deviation from Normality 


Size of | Mode | Mean 


eamiple) “Seale 2/2 Gslo | oy/(o/,/2n) | Skewness | (Si | Bo 
| —— = 
2 | 0-0 5642 | +4263 8525 1-3236 | 9906 | 3-8692 
3 | ‘5774 | 7236 | -3782 9265 3867 | 3983 3-2451 


Experimental Verification for the Case of Samples of Two. ; 
The frequency distribution of the number of stigmatic bands on the capsules of a growth of 
Shirley poppies is as follows*: 


TABLE II. 


No. of Stigmatic 
Bands soe |) ob eh ez 8 OO) 2A S45) 5 
1 


16) 17 | 18 | 19 | Total 
No. of Capsules 11 | 32 | 56 | 148 | 363 | 628 | 925 | 954 | 709 | 397 | 155| 51 


12 1 | 4448 


the standard deviation of the distribution, as calculated by the ordinary method, being 1-8977. 


We will examine the frequency distribution of the values of the standard deviation of this 
series which are given by samples of two capsules taken at random. Now the standard deviation 
of two measurements 2, and 2, is easily shown to be 7 taken with the plus sign, so that 
in this case, the variate being measurable only in units, the possible standard deviations will 
be 0, -5, 1, 1-5, ..., and we can find the theoretical frequencies (column (b) of Table III) of these 
values by taking the areas of the strips of the half-Gaussian 

2N -(——- : 
y= 18977 Va e (a aa) f 
whose bounding ordinates cut the axis of 5 at 0, -25; -25, -75; -75, 1-25; ..... Thus the breadth 
of the first strip is only half that of the others and it will be found that it is the value -5 which is 
the real mode of the probability. 

In the case of such a distribution we can find the frequencies of the standard deviations of 
samples of two by actual calculation. For if we denote the chance of a capsule occurring with 
5, 6, 7, 8, 9, ... stigmatic bands by a, 0, c, d, e, ... respectively, it is clear that the chance of the 
value 0 of the standard deviation occurring is a? + b? + c? + d?+ ..., and the chance of the value -5 
occurring is the chance of a sample of two flowers whose numbers of stigmatic bands differ by 1, 
ie. is 

ab+b(a+c)+ce(b4+d)+...=2(ab + be + cd + ...), 
and similarly for 5 = -1 the chance is 


ac+ bd+c(a+e)+d(b+f)+...=2(ac+ bd + ce + ...), 
and so on. 


By this means the theoretical frequencies given in column (a) of Table III were calculated. A 
histogram of these values will be found to be well in accord with the half-Gaussian given above. 


* Pearson, Phil. Trans. Vol. 197 A, p. 314, Hampden Series. 
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With a view to confirming the theoretical results, a practical random sampling was made in 
the following way. A circle, about 20 inches in diameter, was carefully divided into sectors 
whose angles (and therefore areas) were proportional to the class-frequencies given in Table II. 
For each sampling a pointer was placed at random inside the circle and the number of the sector 
in which it was placed was noted. It appeared that, if the observer kept his eyes shut and 
rotated the circle for a time between each pointing, the method wouid give quite satisfactorily 
random sampling. In this way an ordered series of 648 samples was made and by taking samples 
of two by pairing in accordance with three different rules, the series of numbers given in Table III 
were obtained. Alongside of each series in the Table is given the ratio of the number in each 
set to the total. These ratios are to be compared with the “theoretical chances.” In all three 
samples the number with standard deviation 0 is considerably less than the theoretical numbers, 
but from the values of the Goodness of Fit “P” given at the ends of the columns it appears that this 
can be accounted for by the variations of random sampling. 


TABLE III. 
Frequency distribution of Standard Deviations of Samples of Two 


Capsules from the Shirley Poppy Series. 


Experimental Results 
Standard Theoretical I II Ii 
Deviation Chance = E - 
Number of Number of Number of 
(a) (b) samples Chance samples Chance aataples Chance 
0 | -tee2 | -1478 39 1204 S800 aire 37 1142 
5) +2820 2760 102 3148 99 | +3056 109 +3364 
1 +2249 2246 75 -2315 75 -2315 74 +2284 
1:5 +1553 -1594 55 -1698 58 -1790 44 -1358 
2-0 -0937 -0986 26 -0802 26 -0802 31 -0957 
2:5 -0503 -0532 7 “0216 15 -0463 15 0463 
3-0 -0243 -0250 12 ‘0370 5 -0154 8 +0247 
3°5 -O108 “0102 5 ‘0154 5 “0154 5 “0154 
4-0 0043 -0037 3 -0093 2 -0062 0 -0000 
4-5 ‘0015 -0011 0 “0000 1 -0031 1 | :0031 
5-0 -0005 -0003 0 -0000 0 -0000 0 | -0000 
5:5 -0001 -0001 0 ‘0000 0 -0000 0 -0000 
Using Theoretical Chance (a) =150, P=18 | 2=83, P=-69 y2=105, P=-49 
Using Theoretical Chance (0) 2=16-6, P=-12 7=8:9, P=-63 x=11:3, P=-42 


The mean standard deviations given by these sets are for I, 1-06, for II, 1-07, for IIT, 1-05, 


while those derived from the half-Gaussian and from the “theoretical chances” are 1-071 and 
1-050 respectively. It will be noticed also that although the calculated chances (a) appear to 
give a better fit than the half-Gaussian, the differences in the ‘ P’s” are very slight, and the 
half-Gaussian would suffice for most purposes. These results form a strong confirmation of the 


correctness of the theory. 
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W. R. MACDONELL. 


Born October 16, 1852. Died May 15, 1916. 


For the third time the hand of death has fallen heavily on this Journal. 
Dr W. RB. Macdonell died on May 15 at the relatively early age of 63. On his retire- 
ment from business in 1899, at the suggestion of W. H. Macaulay he joined the 
Biometric Laboratory to do research work, and his patient labour, his wise counsel 
and lovable disposition soon made him an essential part of the place. It would 
be difficult to appraise adequately the help he gave to the Biometric School in its 
early days. It was not only in material and apparatus, it was not only in resourceful 
suggestion to his fellow-workers, but it was especially in the general sense of courage 
and in the spirit of readiness to undertake the tedious because it meant profit 
to science in the future, which he diffused around him that his help was so invaluable. 
And there was need of the heartening which Macdonell gave! The first greeting 
that Fawcett’s paper on the Naqada crania, involving years of work, received from 
anthropologist and anatomist was anything but favourable, but Macdonell per- 
sisted on the lines thus laid down in his great memoirs on the 17th century English 
crania, and it would have gladdened him to have seen Fawcett’s memoir now 
described by a distinguished anthropologist as Fawcett’s “classical memoir,” 
and the anatomical author of the most recent British cranial research stating that 
he “cannot do better than follow the scheme adopted by Fawcett in her study of 
the Naqada crania and also by Macdonell in his study of the Whitechapel English 
crania and other series of English skulls.” It takes a long while to reform any 
branch of science, but when the history of craniometry comes to be written, those 
early workers in the Biometric Laboratory will be remembered, and not the least 
Macdonell, who gave heart to them all. 


When Biometrika was founded Macdonell joined the little group of men who 
provided the guarantee fund—there is small harm now in mentioning their names: 
Francis Galton, W. F. R. Weldon, W. R. Macdonell, Lord Parker of Waddington, 
and the present Editor—and the aid then provided carried this Journal through 
the troublous days of infancy to the completely self-supporting stage which 
preceded the war. It is a sad element in the fate of this Journal that it has to 
meet a new crisis in its existence with many new friends it is true, but without the 
majority of the old supporters. From the first issue until this very year, Macdonell 
acted as assistant editor of Biometrika, and his name appeared on the title page 
as such for several years after the death of Weldon. Our readers were, however, 
very little conscious of all the labour Macdonell put into proof-reading and many 
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a contributor has profited in a way he little realised by Macdonell’s assiduity 
and not infrequently by his suggestions. Scotland has preserved its knowledge 
and appreciation of grammar long after grammar has been discarded in England, 
and few abler proof-readers can be found than a Scotsman trained in Oxford, 
especially if he has graduated in science, and tempered his science with modern 
European literature as a hobby. The width of Macdonell’s studies and of his 
interests was effectively demonstrated by his memoir “On the Expectation of 
Life in Ancient Rome and in the Provinces of Hispania and Lusitania and Africa.” 
Few men could have been found to combine the necessary biometric with the 
still more needful literary training requisite for a study of this character, and fewer 
still would have concluded it with such words of modesty as Macdonell did. 
With his death biometry loses a sturdy champion, our subscribers and contri- 
butors more than they yet realise, and some of us a close and most trustworthy 
friend. Not least will this be recognised by the little band of pupils he formed 
around him during his occupancy of the lectureship on biometry in the University 
of Aberdeen. He acted straightly and he advised wisely both in his mercantile 
and in his scientific career. He had the reticence which flows from strength, 
and the persistency and courage of the strong who are reticent. Life seemed 
a spacious thing in his handling. What more could we have desired for him? 


We owe to the kindness of Professor W. Paton Ker the following brief notes 
on Macdonell which, emphasising the width of his interests and his genial personality, 
will enable those of our readers who only know him from his memoirs in Beometrika 
to understand better another of the early members of the Biometric School. 


I first met Macdonell in my freshman’s term at Balliol: I succeeded him in 
rooms possibly the worst in College, and he came in a few days later to ask if I 
had seen his copy of Victor Hugo’s poems lying about. This I had already found 
and now restored to him; it provided something to talk about afterwards. He 
was a Mathematical Scholar of Balliol and did not neglect his subject, but he 
always seemed more interested in other things, especially Greek. He had been at 
Aberdeen along with R. A. Neil of Pembroke, Cambridge, always one of his greatest 
friends; at Balliol he found another Greek scholar, William Gunion Rutherford. 
Rutherford and he used to read Greek together for a time. Then Macdonell 
left Oxford to go into business, and for a year or two I saw nothing of him. 
But I found him in London when I went there in 1879 after taking my degree; 
Rutherford was there also, a master at St Paul’s School, and we made it a regular 
thing to meet on Saturday evenings, the three of us, at Rutherford’s rooms in 
Mitre Court. It was a good time while it lasted, but soon we went different ways 
—Macdonell to India—and I lost sight of him and did not hear much, for several 
vears. Soon after he got to Bombay he wrote to me, asking for the Journal of 
Philology to be sent to him; he was keeping up his Greek and Latin; he was 
also amused at the manners of scholars in disputation. At Bombay he was once 
asked by the Danish merchants there to choose for himself a present which they 
wished to give him in acknowledgement of services rendered by him in connexion 
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with their business. He chose, as the best thing obtainable from Copenhagen, 
Madvig’s edition of Cicero De Finibus, and thought that they were pleased with 
his choice. 


Macdonell was chairman of the Chamber of Commerce in Bombay, and 
additional member of Council there, 1893-1895. Then he came home, and for 
some time worked in the London office of his firm. He retired from business in 
1899. He had carried out his plans, and earned his leisure. Probably few men 
are to be trusted with leisure at the age of 47, but Macdonell made the most of his 
life. He never lost interest in any study that once had engaged his attention, 
though he had so many interests—in books, pictures, music and travelling— 
that naturally he could not always attend to everything. He never forgot a friend. 
He had been six years married when he came home from India, and the friend 
that was made welcome by his wife, and his children, as well as himself in his 
house at Enfield has much to thank him for. He made new friends as he went on, 
through common interests for the most part, and particularly through the investi- 
gations in which he took part along with Weldon and Pearson. He felt very 
deeply the loss of friends; he could scarcely bear to speak of the death of Neil 
of Pembroke. 


He inherited his love of books; one of his early recollections was the farm- 
house in the Highlands where it was the custom to read aloud in the evenings, 
usually from some of the English classics but sometimes Don Quixote to the 
farm-servants; light being provided for the reader from a stock of pine splinters, 
lighted in succession one from another: one of the party was employed in cutting 
these brands, and a boy held up the light over the book, and renewed it from the 
supply that was passed to him. Macdonell’s people were Catholics, and his study 
of the humanities may have owed something to the traditions of the Church. 
Among his friends was Monsignor Fraser, Rector of the Scots College in Rome. 
I remember Macdonell bringing back from a visit to Rome a story of a pibroch 
heard at night there, and how the music was traced to the roof of the College— 
a story that would have delighted Sir Walter Scott. 


No weakness of the human race gave Macdonell more amusement than the 
vanity of biographers, and I can almost see him looking over these notes of mine, 
and making disrespectful remarks about them. But I am glad to have been 
allowed to write them, such as they are. 


W.-P. Kur. 


UNIVERSITY CoLLEeGE, Lonpon. 
3 July 1916. 
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FURTHER SUPPLEMENTARY TABLES FOR DE- 
TERMINING HIGH CORRELATIONS FROM 
TETRACHORIC GROUPINGS. 


By ALICE LEE, D.Sc. 


Tue difficulty of determining correlations between -80 and 1-00 by the tetra- 
choric method, owing to the slow convergency of the terms of the fundamental 
equation for ‘tetrachoric r’ has long been recognised. In 1912 Everitt* published 
‘Supplementary Tables for Determining High Correlations from Tetrachoric 
Groupings.” These tables much simplified the work within the field in which it is 
really possible to determine accurately a high correlation—beyond certain values 
of ‘h and k’ such determination is impossible owing to the influence of random 
sampling on a quadrant category which in most practical cases will only contain 
an isolated unit or two. Everitt’s tables covered the values of r from + -80 to 
+ 1-00 for values of the dichotomic planes given by f and k varying from + -0 
to+ 2-6. They admitted at once of our dealing with those cases of negative values 
of r, for which either h or k was negative, but not with cases in which 7 was 
negative and both h and k/ remained of the same sign. The present tables provide 
for this omitted portion of the possible field and thus complete Everitt’s work. 

I have followed his method of quadrature in evaluating my integrals. But 
I have preserved more decimal places than he has done, partly because my 
significant figures are thrown into higher decimal places than his by the nature 
of the case, and partly because recent experience in other fields has shown workers 
in the Biometric Laboratory, that tables are often of service for purposes other 
than those for which they were originally calculated, and that it is worth while 
preserving every reliable figure. I think my results are always correct to six 
figures and generally to the actual number tabulated. While many of the entries 
would only be of service in the case of total populations approaching the magnitude 
of census populations, their indication of the high decimal place in which the 
first significant figure can occur in the ‘d’ quadrant, will help to dispel the illusion 
that absence of frequency in a quadrant is necessarily indicative of absolute 
association, when the table is based on a limited sample. A further reason for 
the number of figures I have preserved may be found in what I believe to be one 
of the principal uses of tables of this type. They are not only valuable for the 
calculation of the correlation in the case of given data, but also in many theoretical 
investigations where the correlation is supposed known a priori and we require 


* Biometrika, Vol. vin. p. 385. 
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to ascertain how many individuals may be expected to possess a combination of 
characters each of which exceeds a given value. I have adopted throughout 
Pearson’s original notation in his memoir on the fourfold table*, i.e. d is the 
frequency of the quadrant diagonally opposed to that in which the mean falls, i.e. : 


ady, 


ee eer area 
os ea 

where in our case r is given values from — -8 to — 1-0. This is the integral I have 

tabled. For the case of r = — 1-0, d/N will be zero for all values of h and k, so 

that it is not needful to provide a table for r = — 1-0 and interpolation between 

— -95 and — 1-0 must be between the values given in the table for — -95 and 

zero values. 


hJk 


Illustration I. An appreciation of sex was made by two different observers 
on 216 femora. It is required to find a measure of the association in judgment 
between the two observers. 

In appreciating sex by the examination of a bone the observers opinion varies 
from practical certainty of maleness through every shade of doubt to practical 
certainty of femaleness. The strength of the judgment is therefore a continuous 
character, although the actual sex forms a rigid categorical differentiation. It is 
with the judgment, and not with the actual sex that we are here concerned, and 
we have simplified those judgments down to unique categories 3 and 9 classifying 
under each all queried values. 


The table is: 
First observer 


EO 3 | @ | Totals | 

>! . z 

5. | li | 

na 98 }i 16 114 | 

S eet reine Seen eI. NEN ER tad 3 a Mean 

oO | | i 

=a 6 96 102 

rs) 

3 Ve 

ZW) Totals 104 sd 216 
Mean 


we see that h and k will now both be positive and 7 will be negative. 
* Phil. Trans. Vol. 195, A, p. 1. 
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We have d/N = 6/216 = -02778, 
402 — 3 (1 —a,) = -47222, 494 — 2 (1 — ay) = 48148, 
or h = -06969, k = -04644. 
From the Tables for 7 = — -95 


| — “06° ry 
|, hence : E eo} d = 038,334, 


ane as 033,024) 
k=") N , hence h = -06969) d 
: — = -024,169. 
h=l) a _ poo 318 | ogy bi 
Te 2 11) NG Oe mg tae, 


From the last column on right we deduce: 


d 
i. 7H 
N 031,756. 


For the given values of h and k we have accordingly: 


h = -06969, k= -04644, 


r=100, 1r=—-95, 
dN 10) d/N = -031,756. 
Thus for d/N = :02778, we have r = — -9562. 
Accordingly for the table returned to its original form, we conclude that the 


association between judgments of sex made by two independent and competent 
observers from the femur is measured by a tetrachoric correlation of + -956. 


Illustration II. The following data are for the French long bones in the male, 
maximum lengths: 
Femur: Mean = 452-28 mm. Humerus: Mean = 330-10 mm. 
Standard deviation = 23-72 mm. Standard deviation = 15:38 mm. 
Correlation of femur and humerus = -8421. 


(a) Find the percentage of cases in which a humerus of under 300 mm. will 
be combined in the same individual with a femur of over 480 mm. 


Here = fA = 27-72/23-72 = 1-16863; & = — 30-10/15-38 = — 1-95709. 
Since / is positive, k negative and r positive, we must replace our system by 
h = 1168638, k= 1:95709, and +r = — -8421. 
Our tables for r = — -80 and — -85 show that for the given values of h and 
k the required frequency would probably be less than 1 in 50,000,000. We may 


conclude therefore that no such individuals would occur in the total French male 


population. This is a result whose order would hardly be appreciated without 
an examination of the present tables. 


ALick LEE 


(6) Find the percentage of cases in which a humerus of under 320 mm. 


be associated with a femur of over 460 mm. 


Here h = 7-12/23-72 = 32546; & = — 10-10/15-38 = — -65670. 
Accordingly our system is 


h = :32546, k= “65670, y= — -8421. 
Our Tables give for r = — -80 
ho ; = =3 er eer 
ae 3 d/N = :009,0146 ; jes a d/N = -006,2334. 
Thus, for h = -3, k = -65670, d/N = -007,4377. 
ge ; a a d/N = -006,3352; : 7 a d/N = -004,3066. 


Thus for h = :4, k = -65670, d/N = -005,1850. Accordingly for 
h = -32546, k = -65670, 


we find d/N = -006,8642. 
We must now repeat this work for r = — -85. 
h=°:3 A=:3 : . 
pe a d/N = -004,6616 ; ie a d/N = -002,9477. 
Mhus tor h=<+3, k = -65670, d/N = -003,6898, 
h=-4 ps 
— .NN9 90K - =e 
a A aN = -002,9950; a d|N = -001,8483. 


Thus for h = -4, k = -65670, d/N = -002,3448. Accordingly for 
h = 32546, k = -65670, 


we find d/N = -003,3474. 
We have accordingly for the given values of h and k: 
y= — :80, y = — ‘85, 
d/N = -006,8642, d/N = -003,3474, 
or, for r= — -8421, d/N = -003,9006. 
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Accordingly the occurrence of individuals with femur and humerus within the 


limits given is about 0-4 per cent. of the male French population. 


Many other illustrations will occur to the biometric reader of cases wherein 
these additional tables are likely to be of service. I have to record my very hearty 
thanks to Dr W. F. Sheppard for the loan of his manuscript tables of the probability 
integral which enabled me to work to a larger number of figures than his published 


tables, which go to fewer figures, would have permitted. 
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d/N for r= — ‘80. 


h=0 h='l h=2 | h=B h="4 h=5 

k= 0 -1024,160 -0835,592  -0669,639 -0526,767 | -0406,497 | -0307,541 
+1) -0835,592 | -0673,159 | -0532,349 -0413,006 | -0314,152 | -0234,157 
-2 -0669,639 | -0532,349  -0415,199 -0317,511 | -0237,937 |-0174,638 
-3/ -0526,767 |-0413,006 |-0317,511  -0239,210 |-0176,518 |-0127,520 
-4 -0406,497 | -0314,152 | -0237,937  -0176,518 | -0128,207 |-0091,123 
‘5 -0307,541 | -0234,157 | -0174,638 — -0127,520 | -0091,123 | -0063,694 
‘6 -0227,991 |-0170,936 | -0125,482 -0090,146 | -0063,352 | -0043,534 
‘7 -0165,531 | -0122,156 | -0088,225 -0062,334 | -0043,066. | -0029,084 
‘8 -0117,648 | -0085,420 | -0060,673 | -0042,143 | -0028,615 | -0018,986 
‘9 -0081,817 | -0058,423 | -0040,797 -0027,849 | -0018,578 |-0012,107 
1-0 -0055,652 | -0039,068 | -0026,812 -0017,982 | -0011,782 | -0007,539 
1-1 -0037,010 |-0025,535 |-0017,217 | -0011,342 | -0007,297 | -0004,583 
1-2 -0024,056 | -0016,307 |-0010,799 | -0006,985 | -0004,412 | -0002,720 
1-3. -0015,278 |-0010,172 | -0006,615 | -0004,200 | -0002,604 | -0001,575 
1-4 -0009,477 | -0006,196 | -0003,956 | -0002,465 | -0001,499 | -0000,890 
1-5) -0005,741 | -0003,685 | -0002,309 -0001,412 | -0000,842 | -0000,490 
1-6 -0003,395 | -0002,139 | -0001,315 | -0000,789  -0000,462 | -0000,264 
1-7|-0001,960 | -0001,211 | -0000,731 | -0000,430 | -0000,247 |-0000,138 
1-8 -0001,104 | -0000,669 | -0000,396  -0000,229 |-0000,129 | -0000,071 
1-9} -0000,606 | -0000,361 | -0000,209 —-0000,118 |-0000,065. | -0000,035 
2-0 | -0000,3252 | -0000,1897  -0000,1079 -0000,0599 | -0000,0324 | -0000,0171 
2-1 | -0000,1700  -0000,0972  -0000,0542  -0000,0295 | -0000,0156] -0000,0081 
2-2 | -0000,0867 | -0000,0486 | -0000,0266  -0000,0142 | -0000,0074 | -0000,0037 
2-3 | -0000,0431 | -0000,0237 | -0000,0127  -0000,0066  -0000,0034| -0000,0017 
2-4 | -0000,0209 | -0000,0112 -0000,0059  -0000,0030 | -0000,0015] -0000,0007 
2-5 | -0000,0099 | -0000,0052 | -0000,0027 | -0000,0013 | -0000,0007 | -0000,0003 
2-6] -0000,0045) -0000,0023  -0000,0012) -0000,0006 | -0000,0003} -0000,0001 

h=9 | h=10 | h=l1 | A=12 h=13 h=14 
| | 

k= 0/-0081,817 |-0055,652 -0037,010 | -0024,056  -0015,278 | -0009,477 
-1|-0058,423 |-0039,068 -0025,535 |-0016,307 |-0010,172 | -0006,196 
-2/-0040,797 |-0026,812 -0017,217 |-0010,799 | -0006,615 | -0003,956- 
-3/-0027,849 |-0017,982 -0011,342 |-0006,985 |-0004,200 | -0002,465 
-4/-0018,578 |-0011,782 -0007,297  -0004,412 | -0002,604 | -0001,499 
-5|-0012,107 |-0007,539 -0004,583 | -0002,720 | -0001,575 | -0000,890 
-6|-0007,705 |-0004,710 -0002,810 -0001,637 | -0000,930 | -0000,515 
-7|-0004,788 |-0002,873 -0001,682 | -0000,961 -0000,535 | -0000,291 
-8 | -0002,904 |-0001,710 | -0000,982 -0000,550 | -0000,301 | -0000,160 
-9}-0001,719 |-0000,993 | -0000,559 |-0000,307 -0000,165 | -0000,086 
1-0 -0900,993. | -0000,562 | -0000,311 |-0000,167 | -0000,088 | -0000,045 
1:1|-0000,559 |-0000,311 -0000,168 | -0000,089 | -0000,046 | -0000,023 
1-2} -0000,307 |-0000,167 | -0000,089 | -0000,046 | -0000,023 | -0000,011 
1-3|-0000,165 |-0000,088 | -0000,046 | -0000,023 |-0000,012 | -0000,006 
1-4|-0000,086 |-0000,045 -0090,023 | -0000,011 |-0000,006 | -0000,003 
1-5|-0000,044 |-0000,023 -0000,011  -0000,005. | -0000,003. | -0000,001 
1-6|-0000,022 |-0000,011 -0000,005  -0000,003 |-0000,001 | -0000,0005 
1-7) -0000,011 | -0000,005 000,002 -0000,001 | -0000,0005} -0000,0002 
1:8} -0000,005 |-0000,002 | -0000,001 | -0000,0005. -0000,0002| -0000,0001 
1-9} -0000,002 |-0000,001  -0000,0005 -0000,0002  -0000,0001) -0000,0000 
2-0| -0000,0010 | -0000,0005  -0000,0002 -0000,0001  -0000,0000 
2-1|-0000,0004 | -0000,0002 -0000,0001 | -0000,0000 | 
2-2) -0000,0002 |-0000,0001  -0000,0000 | 
2-3 -0000,0001 | -0000,0000 | | | 
2-4 | -0000,0000 


h=6 h="7 h='8 
-0227,991 |-0165,531 |-0117,648 
-0170,936 | -0122,156 | -0085,420 
-0125,482 | -0088,225 | -0060,673 
-0090,146 | -0062,334 | -0042,143 
-0063,352 | -0043,066 | -0028,615 
-0043,534 |-0029,084 | -0018,986 
-0029,242 |-0019,193 | -0012,306 
-0019,193 |-0012,373 | -0007,790 
-0012,306 | -0007,790 | -0004,814 
-0007,705 |-0004,788 | -0002,904 
-0004,710 | -0002,873 | -0001,710 
-0002,810 | -0001,682 | -0000,982 
-0001,637 | -0000,961 | -0000,550 
-0000,930 | -0000,535 | -0000,301 
-0000,515 |-0000,291 | -0000,160 
-0000,278 |-0000,154 | -0000,083 
-0000,147 |-0000,080 | -0000,042 
-0000,075 |-0000,040 | -0000,021 
-0000,038 |-0000,020 | -0000,010 
-0000,018 |-0000,009 | -0000,005 
-0000,0088 | -0000,0044 | -0000,0021 
-(000,0041 | -0000,0020 | -0000,0010 
-(000,0018 | -0000,0009 | -0000,0004 
-0000,0008 | -0000,0004 | -0000,0002 
-(000,0003 | -0000,0002 | -0000,0001 
-(000,0001 | -0000,0001 | -0000,0000 
-(000,0001 | -0000,0000 
h=V'5 h=16 h=l7 
-0005,741  0003,395 | -0001,960 
-0003,685 -0002,139 |-0001,211 
-0002,309 -0001,315 | -0000,731 
-0001,412 | -0000,789 | -0000,430 
-0000,842 | -0000,462 | -0000,247 
-0000,490 , -0000,264 | -0000,138 
-0000,278 | -0000,147 | -0000,075 
-0000,154 |-0000,080_ | -0000,040 
-0000,083 | -0000,042. | -6000,021 
-0000,044 | -0000,022 | -0000,011 
-0000,023  -0000,011 | -0000,005 
-0000,011 -0000,005. | -0000,002 
-0000,005 -0000,003. | -0000,001 
-0000,003 | -0000,001 | -0000,0005 
-0000,001  -0000,0005 | -0000,0002 
-0000,0005  -0000,0002 | -0000,0001 
-(000,0002  -0000,0001 | -0000,0000 
-0000,0001  -0000,0000 
-0000,0000. 

| 
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dj/N for r= — ‘80. 
. , 
h=18 p=19 | h=2-0 h=21 | h=22 h=23 | h=2 h=25 = 26 
| —sess || ps Se | eS — = = 
k= 0/-0001,104 | -0000,606 |-0000,3252  -0000,1700. -0000,0867 | -0000, 0.0431, .0000, 0209 9 .0000,0099) -0000,0045 
-1|-0000,669 | -0000,361 | -0000,1897 -0000,0972_ -0000,0486  -0000,0237, -0000,0112 | -0000,0052_ -0000,0023 
- .2|-0000.396 | -0000.209 | -0000.1079 | -0000,0542. -0000,0266 | -0000,0127 | -0000,0059 | -0000,0027 +0000,0012 
-3|-0000.229 | -0000.118 | -0000,0599 | -0000,0295  -0000,0142  -0000,0066  -0000,0030 | -0000,0013 -0000,0006 
-4|-0000,129 | -0000,065  -0000,0324 -0000,0156, -0000,0074_ -0000,0034 -0000,0015 | -0000,0007  -0000,0003 
-5|-0000,071 | 000,035. -0000,0171 , -0000,0081 | -0000,0037  -0000,0017  -0000,0007 | -0000,0003' -0000,0001 
-6| -0000,038 | -0000,018  -0000,0088  -0000,0041  -0000,0018  -0000,0008  -0000,0003 | -0000,0001  -0000,0001 
7|-0000,020 000,009 | -0000,0044 -0000,0020  -0000,0009 -0000,0004, -0000,0002 | -0000,0001 -0000,0000 
-8|-0000,010 | -0000,005 -0000,0021  -0000,0010, -0000,0004 | -0000,0002. -0000,0001 | -0000,0000 
-9|-0000,005 | -0000,002  -0000,0010. -0000,0004. -0000,0002. -0000,0001  -0000,0000 | 
1-0] -0000,002 | -0000,001 00,0005, -0000,0002, -0000,0001 -0000,0000 
1-1|-0000,001 | -0000,0005 | -0000,0002. -0000,0001 -0000,0000 
1-2| -0000,0005 | -0000,0002 -0000,0001  -0000,0000 
1-3 | -0000,0002 | -0000,0001  -0000, ,0000. 
1-4 | -0000,0001 | -0000,0000. | 
1-5|-0000,0000 | | 
d/N for r= —°85 
| h=0 ae h=:2 h='3 h='4 h='5 | h='6 h=7 h='8 
| | | 
k= 0 -0883,010 |-0696,651 |.0537,127 -0404,314 |-0296,851 |-0212,402 |-0147,991 -0100,334 | -0066,146 
1 -0696,651 | -0539,919 | -0408,547 | -0301,542 |-0216,905 -0151,935 -0103,559 068,639 -0044.211_ 
-2 -0537,127 |-0408,547 | -0303,123 |-0219,192 |-0154.352 |-0105,767 |-0070,478 |-0045,639 | -0028,705 | 
-3-0404,314 |-0301,.542  -0219,192 |-0155,166 |-0106.889 | -0071.604 |-0046,616 -00291477  -0018.094 
4 -0296,851 /-0216,905 | -0154,352 | -0106,889 | -0071,984 | -0047,113  |-0029,950 -0018,483 | -0011,068 
-5 -0212,402 |-0151,935 -0105,767 | -0071,604 |-0047.113 | -0030,109 |-0018,680 -0011.246  -0006.566 
-6 -0147,991 |-0103,559 -0070,478 |-0046,616 |-0029.950 |-0018,680 |-0011.306 -0006,637 | -0003.777 
‘7 -0100,334 |-0068,639 -0045,639 |-0029,477 |-0018.483  -0011.246 | -0006.637 -0003.797  -0002.106 
8 -0066,146 | -0044,211 | -0028,705 |-0018,094 |-0011,068 | -0006,566 |-0003,777 -0002,106 -0001.137 
-9 |-0042,377 |-0027,658 -0017.526 |-0010,778 }-0006.428 -0003.717 | -0002.083  -0001.131  -0000.595 
1-0 -0026,368 -0016,797 | -0010,383. |-0006,226 |-0003,620 | -0002.039 |-0001,113 -0000,589  -0000,301 
1-1 /-0015,927 -0009,897 005,966. |-0003,487 |-0001,975  -0001,084 -0000,576  -0000,297 | -0000,148 
1-2 -0009,335 -0005,656 | -0003,323 |-0001,893 |-0001,044 | -0000,558 |-0000,289 -0000,145  -0000.070 
1-3, -0005,306 | -0003,134 | -0001,794 | -0000,995 |-0000,535 0000278 |-0000,140 000,068 -0000,032 
1-4 |-0002,924 | -0001,682  -0000,938 | -0000,507 |-0000,265  -0000,134  -0000,066. 0000031 -0000,014 
1-5 -0001,561 |-0000,875 | -0000,475 | -0000,250 |-0000,127  -0000,063 -0000,030 -0000,014 -0000,006 
1-6 -0000,808 | -0000,441  -0000,233 | -0000,119 |-0000,059  0000,028 -0000,013  -0000,006  -0000,003 
1-7 -0000,405  -0000,215  -0000,111 | -0000,055 |-0000,027  -0000,012  -0000,006  -0000,002  -0000,001 
1-8 -0000,196 -0000,102 -0000,051 |-0000,025 |-0000,012 | -0000,005 -0000,002 -0000,001 0000/0004. 
1-9 -0000,092 | -0000,046  -0000,023 | -0000,011 |-0000,005 -0000,002 -0000,001  -0000,0004 -0000,0001 
2-0 |-0000,0419 -0000,0205  -0000,0097 | -0000,0045 | -0000,0020. -0000,0009. -0000,0004. -0000,0001 -0000,0001. 
2-1 -0000,0184  -0000,0088  -0000,0040 | -0000,0018 | -0000,0008 -0000,0003 -0000,0001. -0000,0000. -0000,0000 
2-2 -0000,0079 -0000,0036  -0000,0017 | -0000,0007 | -0000,0003 -0000,0001  -0000,0000 | 
2-3, -0000,0032 -0000,0015, -0000,0006 | -0000,0003 | 0000,0001 -0000,0000 
2-4 -0000,0013 -0000,0006 -0000,0002. -0000,0001 | .0000,0000, | 
2:5. -0000,0005 -0000,0002 -0000,0001 | 0000, 0000 | | 
2:6. -0000,0002 -0000,0001, -0000,0000 | | 
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a/N for r= — ‘85. 
| 
| h='9 | h=10 T= Velo aie ilee h=1:3 h=14 h=15 h=1%6 h=17 
|k= 0|-0042,377 |-0026,368 |-0015,927 | -0009,335 | -0005,306 | -0002,924 | -0001,561 |-0000,808 | -0000,405 
| +1 -0027,658 | -0016,797 | -0009,897 | -0005,656 | -0003,134 | -0001,682 | -0000,875 | -0000,441  -0000,215 
-2/-0017,526 | -0010,383. |-0005,966 | -0003,323 |-0001,794 | -0000,938 |-0000,475 | -0000,233  -0000,111 
*3|-0010,778 | -0006,226 | -0003,487 | -0001,893 | -0000,995 | -0000,507 | -0000,250 -0000,119 | -0000,055 
-4|-0006,428 | -0003,620 | -0001,975 | -0001,044 | -0000,535 | -0000,265 | -0000,127 | -0000,059 | -0000,027 
‘5 |-0003,717 | -0002,039 | -0001,084 | -0000,558 -0000,278 -0000,134 |-0000,063  -0000,028 — -0000,012 
‘6 | -0002,083  -0001,113 | -0000,576 | -0000,289  -0000,140 | -0000,066 | -0000,030 | -0000,013 | -0000,006 
-7|-0001,131  -0000,589 | -0000,297 | -0000,145 | -0000,068 | -0000,031 |-0000,014 | -0000,006 | -0000,002 
— +8 -0000,595  -0000,301 | -0000,148 -0000,070 | -0000,032 | -0000,014 |-0000,006 | -0000,003 | -0000,001 
9 -0000,303 | -0000,149 | -0000,071 | -0000,033 | -0000,015 | -0000,006. |-0000,003 -0000,001 | -0000,0004 
 1-0/-0000,149 | -0000,072 | -0000,033 | -0000,015 | -0000,006 — -0000,003 |-0000,001 , -0000,0004  -0000,0002 
— 1-1) -0000,071  -0000,033 | -0000,015 | -0000,007 | -0000,003 | -0000,001 | -0000,0004. -0000,0002  -0000,0001 
| 1-2) -0000,033 | -0000,015 | -0000,007 | -0000,003 | -0000,001  -0000,0005 | -0000,0002  -0000,0001  -0000,0000 
| 1:3]-0000,015 | -0000,006 | -0000,003 |-0000,001 | -0000,0005  -0000,0002 -0000,0001 -0000,0000 
| 1-4 -0000,006 | -0000,003 | -0000,001 | -0000,0005 | -0000,0002  -0000,0001 | -0000,0000_ | 
| 1-5) -0000,003 | -0000,001 | -0000,0004 -0000,0002 | -0000,0001  -0000,0000 | | 
1-6 | -0000,001  -0000,0004 | .0000,0002  -0000,0001  -0000,0000 | | | 
1-7 | -0000,0004 | -0000,0002 | -0000,0001 | -0000,0000 | | 
1-8 | -0000,0002  -0000,0001 | -0000,0000 | | 
1-9 | -0000,0001  -0000,0000 | 
2-0, -0000,0000, | 
h=1'8 h=19 h=2:0 h=2°71 h=2'2 h=2°3 h=2°4 h=2'5 h=2'6 
(k= 0) -0000,196 | -0000,092  -0000,0419 | -0000,0184 -0000,0079  -0000,0032 | -0000,0013  -0000,0005 | -0000,0002 
-1/-0000,102 -0000,046 — -0000,0205  -0000,0088 -0000,0036 -0000,0015 | -0000,0006 | -0000,0002 | -0000,0001 
| +2 -0000,051  -0000,023  -0000,0097 -0000,0040 -0000,0017  -0000,0006 | -0000,0002 | -0000,0001 | -0000,0000 
| +3. -0000,025  -0000,011  -0000,0045  -0000,0018 -0000,0007 -0000,0003  -0000,0001 | -0000,0000. 
-4/-0000,012  -0000,005  -0000,0020 | -0000,0008 | -0000,0003 | -0000,0001 | -0000,0000 
| +5 |0000,005 — -0000,002  -0000,0009 | -0000,0003  -0000,0001 -0000,0000 | 
‘6 |-0000,002 -0000,001 | -0000,0004 | -0000,0001 | -0000,0000. 
“7 |-0000,001  -0000,0004_ -0000,0001 | -0000,0000 | 
-8 |-0000,0004 -0000,0001 -0000,0001 | | 
‘9 |-0000,0002 -0000,0001  -0000,0000 | 
1-0 | -0000,0001 -0000,0000 | 
1-1 | -0000,0000 | | 
| 
a/N for r= — 90. 
R=). eh lee eee 2 sey) isa p= 5 h=6 hal h='8 
k= 0 -0717,832 | -0535,024 -0385,698 | -0268,454 | -0180,108 | -0116,302 | -0072,185 |-0043,011 | -0024,576 
| +1 -0535,024 | -0387,680  -0271,230 | -0182,917 | -0118,733 | -0074,080 -0044,371 | -0025,486 | -0014,024 
| +2 -0385,698 | -0271,230 | -0183,863 | -0119,967 | -0075,240 | -0045,302 -0026,157 | -0014,469 | -0007,661 
+3 -0268,454 | -0182,917  -0119,967 -0075,631 | -0045,775 | -0026,569 -0014,774 | -0007,863 | -0004,003 
-4/-0180,108 | -0118,733 -0075,240 | -0045,775 | -0026,707 | -0014,928 -0007,987 | -0004,087 | -0001,999 
‘5 -0116,302 | -0074,080  -0045,302 | -0026,569 | -0014,928 | -0008,029  -0004,130 | -0002,030 | -0000,953 
‘6 -0072,185 | -0044,371 -0026,157 | -0014,774 -0007,987  -0004,130  -0002,041 | -0000,963  -0000,434 
‘7 -0043,011 | -0025,486 -0014,469 | -0007,863  -0004,087 | -0002,030 | -0000,963 | -0000,436 | -0000,189 
-8 -0024,576 | -0014,024 | -0007,661 | -0004,003 | -0001,999  -0000,953 000,434 | -0000,189 | 000,078 
‘9 -0013,452 |-0007,386 | -0003,879 | -0001,947  -0000,934 | -0060,427  -0000,187 | -0000,078 | -0000,031 
1-0 -0007,048 |-0003,721 -0001,878 | -0000,905 | -0000,416 | -0000,183 | -9000,077 | -0000,031 | -0000,012 
1-1 -0003,532 |-0001,791  -0000,868 | -0000,401 | -0000,177 | -0000,075  -0000,030 | -0000,011 | -0000,004 
1-2 -0001,692 | -0000,824 -0000,383 |-0000,170 | -0000,072 | -0000,029 | -0000,011 | -0000,004 | -0000,001 
1-3. -0000,774 | -0000,362 -0000,161 |-0000,069 | -0000,028 | -0000,011 | -0000,004 | -0000,001 | -0000,0005 
1-4 -0000,338 | -0000,151 -0000,065 | -0000,026 — -0000,010 | -0000,004 | -0000,001 | -0000,0005 | -0000,0001 
1-5 -0000,141 | -0000,060  -0000,025 |-0000,010 | -0000,004 | -0000,001 | -0000,0004 | -0000,0001 | -0000,0000 
| 16 -0000,056 | -0000,023 — -0000,009 | -0000,003 | -0000,001 | -0000,0004  -0000,0001 | -0000,0000 
1-7 -0000,021 | -0000,008 — -0000,003 | -0000,001 | -0000,0004 | -0000,0001 | -0000,0000 
| 1-8 -0000,008 | -0000,003 -0000,001 | -0000,0004 | -0000,0001 | -0000,0000 
1-9 -0000,003 | -0000,001 | -0000,0003 | -0000,0001  -0000,0000 | 
2-0 -0000,0009 | -0000,0003  -0000,0001 | -0000,0000 | | 
| 2-1 -0000,0003} -0000,0001  -0000,0000 | | 
2-2 -0000,0001 | -0000,0000 | | 
2-3, -0000,0000 | | 
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d/N for r= —-90. 
| | | 
| A= |) ypeslah hove” |) ha 13 h=l4 | hal h=16 (ier 
k= 0 -0013,452 |-0007,048  -0003,532 | -0001,692 |-0000,774  -0000,338 -0000,141 | -0000,056 | -0000,021 | 
‘1 -0007,386 | -0003,721 -0001,791 | -0000,824 | -0000,362 | -0000,151 | -0000,060_ | -0000,023 | -0000,008 
-2 -0003,879 |-0001,878 | -0000,868 | -0000,383 | -0000,161 |-0000,065 -0000,025 | -0000,009 | .0000,003 
-3 -0001-947 | -0000,905  -0000,401 | -0000,170 | -0000,069 -0000,026  -0000,010  -0000,003 | -0000,001 
-4 -0000,934 |-0000,416  -0000,177 |-0000,072 | -0000,028 -0000,010 | -0000,004. | -0000,001 | -0000,0004 
‘5 -0000,427 -0000,183 -0000,075 | -0000,029 -0000,011  -0000,004 -0000,001  -0000,0004  -0000,0001 
“6 -0000,187 | -0000,077 | -0000,030 |-0000,011  -0000,004  -0000,001 | -0000,0004 | -0000,0001 | -0000,0000 
‘7 -0000,078 | -0000,031 | -0000,011 | -0000,004 | -0000,001  -0000,0005  -0000,0001 | -0000,0000 | 
8 -0000,031 | -0000,012  -0000,004 | -0000,001_ -0000,0005  -0000,0001 | -0000,0000 
‘9 -0000,012 |-0000,004 | -0000,001 | -0000,0005. -0000,0002. -0000,0000 
1-0 -0000,004 | -0000,001  -0000,0005 | -0000,0002. -0000,0000 
1-1 -0000,001 | -0000,0005  -0000,0002 | -0000,0000, | 
1-2. -0000,0005) -0000,0002  -0000,0000 | | 
1-3, -0000,0002| -0000,0000. | 
1-4 -0000,0000 | 
| 
| | | 
h=18 h=19 | h=2°0 ie eee ee h=2:3 
k= 0/-0000,008  -0000,003 -0000,0009 | -0000,0003 | -0000,0001, -0000,0000, 
-1|-0000,003 | -0000,001 | -0000,0003 | -0000,0001 | -0000,0000 
-2|-0000,001 | -0000,0003 | -0000,0001 | -0000,0000 | 
-3 | -0000,0004. -0000,0001 | -0000,0000 | 
-4 -0000,0001 -0000,0000 | | 
‘5 -0000,0000 | | | | 
d/N for r= — ‘95. 
h=0 he hi as} h='4 h='5 h='6 l= 27 h='8 
k= 0 -0505,416 |-0330,242 | -0202,154 |-0115,376  -0061,136 | -0029,965 |-0013,543  -0005,629 | -0002,146 
‘1 -0330,242 | -0203,181 |-0116,555 | -0062,076  -0030,582 |-0013,893 | -0005,804 | -0002,224 | -0000,781 
-2 -0202,154 |-0116,555 | -0062,393 | -0030,895  -0014,107 | -0005,923 | -0002,282 | -0000,805  -0000,260 
‘3. -0115,376 | -0062,076 | -0030,895 | -0014,179 — -0005,984 | -0002,317 | -0000,822 | -0000,266 | -0000,079 
-4 -0061,136 | -0030,582 |-0014,107 | -0005,984  -0002,329 | -0000,830  -0000,270 | -0000,080 | -0000,022 
‘5 -0029,965 | -0013,893 -0005,923 | -0002,317  -0000,830. | -0000,272  -0000,081 | -0000,022 | -0000,005 
‘6 -0013,543 | -0005,804 | -0002,282 | -0000,822 — -0000,270 | -0000,081 | -0000,022 | -0000,006 | -0000,001 
‘7 -0005,629 | -0002,224 | -0000,805 | -0000,266 -0000,080. |-0000,022  -0000,006 | -0000,601 | -0000,0003 
‘8 -0002,146 | -0000,781 | -0000,260  -0000,079 — -0000,022 | -0000,005  -0000,001  -0000,0003 -0000,0000 
‘9 -0000,749 | -0000,250 |-0000,076 | -0000,021 — -0000,005. | -0000,001  -0000,0003 -0000,0000 
1-0 -0000,239 | -0000,073 | -0000,021 -0000,005  -0000,001 | -0000,0003. -0000,0000 | 
1-1 -0000,070 | -0000,020 | -0000,005 | -0000,001 , -0000,0002 | -0000,0000. | 
1:2 -0000,019 | -0000,005 | -0000,001 | -0000,0002. -0000,0000 | | 
1:3 -0000,004 | -0000,001  -0000,0002, -0000,0000 | | 
1-4 -0000,001 | -0000,0002 | -0000,0000 | | | 
1-5. -0000,0002  -0000,0000 | | | 
1-6 -0000,0000 | | | | 
ae ee a 
h=9 NAD) h=11 h=1-2 h=1: h=14 h=1:5 
| | 
“k= 0 -0000,749  -0000,239 | -0000,070 | -0000,019 | -0000,004  -0000,001 | -0000,0002 | 
‘1 -0000,250 | -0000,073 | -0000,020 | -0000,005  -0000,001  -0000,0002 | -0000,0000 
‘2 -0000,076  -0000,021 | -0000,005 | -0000,001  -0000,0002. -0000,0000 | 
-3/-0000,021  -0000,005 | -0000,001  -0000,0002. -0000,0000. 
‘4 -0000,005 -0000,001 | -0000,0002 | -0000,0000 | 
‘5 -0000,001  -0000,0003 | -0000,0000 | | 
-6 | -0000,0003 -0000,0000 | | 
“7 | | 


ON THE PROBABLE ERROR OF BISERIAL 2. 
By KARL PEARSON, F.R.S. 


(1) In a paper in Biometrika, Vol. vit, pp. 248-257, entitled “On a New 
Method of determining Correlation where one Variable is given by alternative and 
the other by multiple Categories,” I gave in 1910 the process of determining what 
is now generally termed “biserial 7.” 


Let z be the alternative, y the multiple variate, %, the distance from the division 
between the alternative categories of the mean of the array of x’s corresponding to 
a given value of y, ,o, its standard deviation and n, its frequency. Let #, o, and 
N be the corresponding quantities for the marginal total. Then, if 


(i £/cz, Vea BSc 


1 dy \2)  /B\273 
No \n, & = e 


7= aera weet REE eae (1), 
1 aD, Ve 
Ea oll Ree, 
1+ w' \n, ) } 


and explained that 7 might be found from the usual tables of the probability 
integral. I did not publish the probable error of this method of determining 
“biserial 7” at the time, and it has remained one of the few outstanding cases 
where probable errors were still wanting. I now add the determination of the 
probable error, and owing to the kindness of my colleagues, Miss Ethel M. Elderton 
and Miss B. M. Cave, am able to give a table for its fairly easy computation. 


I showed that 


(2) I take as my biserial table the following: 


ey Noy | Rig 2 i \odecee Vy 
Np Noo | Mosh Ni eceee Vy 
— ———| — -- —|—— — |-—— — — 
| 
. 
ny Ne | oe CORON meres N 


Here n, stands for n,, and the y variate is the horizontal or multiple category 


variate. We shall write 


ioe 7S (iy), y= Aaa.) eee eee (ii), 


guns 2 
BrShiCEea Vom qi eee Arial), 
and thus n ae 1 Eta 


wie ie ate lard 
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Accordingly to a first approximation 


Se yoy 
dn = jee eee 


where 7, « and y are now mean values and dn, d« and dy are variations from those 
means. Squaring, summing for all values, and dividing by the number of cases: 
1 =) 2 ye (1 52) r 
pope ban y) Demos Luv pd AOS a he 
Heo, (1+ 2)? 20,2 + (+122 Oy (1+ 23 Mean (d«dy) ...(i11). 
Hence to find o, we require to evaluate o,, o, and Mean (dydx). We shall 
now proceed to the consideration of these quantities. 


] ees ela 
We have N =——| e 2" dz. 
rl V2 - Y 
1 -i7? 
Hence OV = — oy ——e =? 
if Y on 
= — dy?, 


where z is the ordinate of the frequency curve of the marginal total at the 
boundary of the alternative categories. 


Bye tens 
Thus og nie = 0,2". 


But if the sample has been taken from an indefinitely large population 


Accordingly — ae ON ty ere Oe eee eee (iv). 
Again KOK = S (KOK), 
where Ke = Rye 
Ko? = S («,20%,.) + 2S {ies Mean (x, 0Ky)} 00. etait (v). 


We require therefore to find o?, and Mean (3«,6«,’). Now 


¢ 8s ¢ 
2K,0K, = Ws +2 N ROW A macrecnoneecetine ad tesco (vi) 
Ly i 8 
But Ny, = nN, —— | e a dx, 
V2 Ys 
dn N4,0n 
a vii 
i ne ses woven (van) 
And accordingly : 
Orne 245 m1 2 22 
3 — —¥ Mean (6n,,5n,) + —7 on, = %°o",,. 
Ns Ns Ng 
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Ny s N., 
20 (2 si Is Py Nae 8 
But Oo ie ae N45 (1 — *) 5 (on ith Ns €@ = 7") 4 


Mean (6n,,5”,) = Mean (6n,,6n,,) + Mean (dn,,)? 


Ny.Nos N45 
— Na a as (1 pas! 7) 


NasNos se 
Thus i (viii). 


It will be seen that (vin) is of the same form as (iv), although the value of n, 
is variable, while that of NV is constant. 


Now multiply (vu) by dn, and sum and we have 


N4y5 Ns Ms a Ns ss 
Ns (1 x) ne Ms ( 7") = — z, Mean (dy,6n,). 


Thus Mean (63612) =10". 76.20... 0. eee ee (ix). 


Returning to (vi), squaring and summing, we find 


le it Ns MMos 
An2g?, = — ly 4n. (1 = *) 4 dy 2a) 2 —1s"2s 
so ks NZ jee s N ae neze |’ 


ce 1 N\., MsNos [Ys\" 
or, KPO", = HB {iveing (1 "i + ai (”*) SE on icsooe (x). 


s 


Again returning to (vi), multiplying it by the corresponding values of 2Kyd«y 
and summing, we reach 


tow 
MeN sos NNs! Deis 
deny Mean (6«,dK,) = — Ve yerye? + 4 “ye Ys¥s! Mean (dy, dy,) 
2s yy? Mean (8nySy,) + 2 +5 yvy2 Mean (5n,8 
+ 2 yn Ve¥s! ean (dn, dy,) + 2 n2Ys'7s, Mean (On, dyy). 
But 
NsamSy. oH y= We MysMy — Nys My Ns Ny } Ms MsNy | Ms Ny Ms\ _ 0 
aed Zee\ Naty N “nen, N nn N nny N 


Further multiplying (vu) by én, we find 


Mean (5n,Sy,) = ! ( Matta a) = 0, also. 


Zs n,N ' n2N 
; Lin gny ee e A 
Accordingly k,ky Mean (6«,d«y) = — [ONS YS Ye cee (xi). 


Returning to (v) we can now evaluate it by aid of (x) and (xi); thus we have 


PrN ixe ines Ms MsNas Ye) — 18 (n nev2y.2)' (xii 
Ro. — wa {tS ite Ns (1 — x) ar S| n, ie oS (nN; Ys Ys ) (xil). 
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It now remains to find Mean (d«dy). We have: 
« Mean (d«dy) = S (x, {Mean (d«,dy)}) 


Bais nt + § {tate 
=i8 Mean (dy6n,)- + S N Mean (dy,dy)>. 


N 
We must now find Mean (dyén,) and Mean (dy, dy). 
Mean (6y,67,) 
Mean (y6n,) = — Nz tees 


ee colt “a 
=~ Hz ("- Nee 


Mean (3y,dy) = Mean ee Ge = nh 


NE ieee © aio. 


oe Vy 45 v1N;\) 
= Wena,’ (! x) me (Me T"')} 


Thus 
« Mean (dxdy) = — ss {3s (™457s") — = S (ny?) — 8 (Tate y.)| (xiii). 


Ne, 


We can now substitute in (iii) from (iv), (xu) and (xiii) and obtain o,?. We 


have 


; 1—~7?)4/ 1 nN, Ny Nos (Vs\2 ; es 
ion oa 4 ai a fe {hs (vin, (1 *)) 7 8 ( = ; (4) ) aS (menor) | 


Sf (LS 1P)? VyVe (2 7 
(1 ab y?)? N? \z 
(1 — aele 1 WA itary? 2 Vig 2 9 (“ute “| 
a + Pp N22 S (14s)s ) = N S (NsVs ) — 28 = ne z, i 
i Pe f(s) ( mye} 
N(1l+y)?2 | N? 5 igor ed ¥) S( N ) 


22g {Mastas Ms ve)" 
ee ea See N le 


\é5 


+ 99 (2) {5 (4) — hs (ere) 29 (he (HT 


siniees (xiv). 


Now we know all the quantities in the sums on the right of (xiv), because 
y; has had to be found for each array, and z, can be at once determined from 
%,/yo, Which is known. But the labour would be very considerable and hardly 
commensurate with the result desired, i.e. an approximate measure of the accuracy 
of determining the correlation by the biserial 7 method. We shall accordingly 
investigate the values of the above terms on the hypothesis of a large number of 
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arrays with linear regression and homoscedasticity, or even with a complete 
Gaussian distribution of frequency. 
= M N L 


R 


In the first place let the division between the two series be the line LL. Let 
C be the centroid of the frequency surface and RR the regression line. Let #— CN 
and %,— PM. Then by the equation to the regression line (correlation += n) 
POQic, = 100 /c,, 


(% — 2,)/o, = ry/oy, 


Hence 
1 9 , iD) 
ye = Se (y? — 2ry’y + r?y’?), 


1 


Ve pays SH Ory rey ets 


Thus, since we may suppose S (n,y’) = S (n,y’*) = 0 approximately : 


1 
S (n,y")/N = ae (y? + 77), 


S (nsys4)/N = Vg Oley Tntsal)> 


i. ( 
(1 — 7?)? 
where f,' is the second statistical B-constant for the y’-array, and since to our 
approximation 9 =r: 


1— pf S(nyA)  (S(ny2)? ieee ; 
‘ im , | Se | re MY =m ty® + a7? (By — D}...-..(xvi). 


2 
pig ee Cy ) mare! : r) 2 a (7 =2hyy re ry)! 


N 2 
os 1 yy or S (7454') Ho, S (2454) 
= (Saye 5 aN Nea 


But as in the theory of biserial 7* 


es ae He “1) 
N ND 


* Biometrika, Vol. vit. p. 97. 


Karu PEARSON 297 


’ My sy \ cat 4 Vy 
and thus S (Sat ) = 1% 5 
, S (my?) _ 1 « { ce 
Again Lae ON Sie Cl ye oo} > 
where the summation S’ is from «= Z to © ; thus 
S (Ny y') Vy (io ii = 22 
. 1l—?r)+ 7? == 9 EO ahe 
N Meee Wh aan 
ak 


Accordingly we have 


NasVse 1 v ae 2 
s (“ur ) ei ae a= 20h ye ane) nivel 


and accordingly 


S (4. S uus . 9.2 ‘ 2 2 
a= 9)%4 Sone Ss (7 PEER ee - antye + oP - 7 - nf) + nye} 


ay (x? aoe WT) Berge nssn ue emneis (xvil). 


Now substitute (xvi) and (xvii) in (xiv) and we find 


(aie ee ae 2 § 1) 
25.2 d : 12 ee froth PPA fre Ph Oy at 
Oy Naa | We bP A+ bn?) tay? (4 N 
(NysNos [Ys\? Ns Y o {laste Ys *t 
a2 \ 20 Wie S = s 9) 2 S 1- = 2 2 
tae) 8 | NsNs (2) at ll 2 | nn, 2, NV’ 
Or, 
(1 — »?)? ( Vg — Vy IL /y\? (vv og [7 ue 
oe ae yap 2 Zi 2 1 | 22, is 1s 1°28 | 
mmmveiem 2 2) 2 )4 ale [Ne Siar | 
1 NigNos / ’ 2 53 
I Ms 1°28 2 Vs Da } (xv 
| ae fey ((1 n ies | I Rar eee an en | Xvill). 


This form for the value of o,? shows that the probable error of 7 does not become 


Me Vv. , 
1s __ = for all values of s, 
aN 
and 7 = oe so that the last two terms in 1/7? take indeterminate forms which need 
s 


infinite when 7 = 0; for in that case, we shall have 


evaluating. For calculation it is best to use the form 


(1 — 77)? Vo — Vv 
coi NIE ple woe G ar a) 


1 (rr @) NysNos [Ys\2 0 NieNos (V\ Ms 
1 2 29 HISKAC 2S Ye) ) 9 2 Y ; 4) ( 1s ''2s (¥ :) 
a ~ es el Corti) ( NsNg & N Coe Z : NN \Z Ne ; 


“s/ 


b 


batted KIRA) 
Biometrika x1 20 
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The functions 


= 
s 


ayes Nys-Nos Vs _ 
s 


Ns. Ns 


() =20-09 (2). 


Pale 


Neo Ne 32s 


2 
a"). 


where x, $(1-+ a) and z are the values given in the tables of the probability 
function, have been calculated by my colleagues Miss Ethel M. Elderton and 
Miss B. M. Cave in the table given on p. 302 below. The labour was considerable 
as owing to the smallness of z for high values of a, the quantities involved 


had to be calculated to many places of decimals. 


put (xix) i the form: 


oan eee [ne ae y? ( 


: 2\¢ ’ Ns ¢ 9 
2 (2) + 1 — a8 (f" F) 2-2 8 


TRS 


Voies. “1) 
N 


If this table be used we can 


I propose to illustrate this on the following example correlating nature of 
vaccination and severity of attack in the case of smallpox. 


Severity of Attack. 


Nature of (1) (2) (3) : (4) (5) Total 
Vaccination Haemorrhagic} Confluent Abundant Sparse Very sparse hie 
0 to 25 yrs. since 5 38 120 176 148 487 
Over 25 yrs. or 44 251 388 308 211 1202 
not at all 
Totals 49 289 508 484 359 1689 
Positions of #/o, = V4 Poo = Yo 3/02 = Ys Uq/O4 = Ya &3/05 = Ys t/O_ = 
means = + 12700] = + 11194] =+-7185 | = +-3488 | =- = +5583 
Ordinates at z= °17810 | z, = -21321 | 2, = -30818 | 2, = -37540 | 2, = 38925 z = 34137 
dichotomic line 
Tel We -10204 -13149 -23622 36364 -41226 = = :28834 
Ng¢/Ng -89796 -86851 -76378 63636 58774 = = -71166 
n,/N “02901 “17111 30077 28656 -21255 Sum = 1-00000 
N4./N -00296 “02250 ‘07105 10420 ‘08763 Sum = -28834=v,/N 
Ver 1-61290 1-25306 -51624 12166 04920 y? = -31170 
¥3/2s 7:13083 5-25022 2-33143 92914 56981 y/z = 1:63547 
(ys/%s)? 50-84867 27-56489 543556 86331 -32469 | (y/z)? = 2-67476 
ngy,2/N 04679 21441 15527 03486 01046 = -46179 
nsyst/N -07547 -26867 ‘06812 00424 ‘00051 a OOMG 
Msys°/N ‘00477 02819 -03668 01268 -00431 Sum = -08663 
a 018955 102594 126515 061613 -029346 | Sum = -339023 
we Wu (%) 135162 588639 -294961 057247 016722 | Sum = 1-042731 
g Ns 8 
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Now n? = (x2 — y*)/((1 + «?) = -102,6755 and 7 = 3204. 
Again, S (n,y.4) N = -41701 actually; but if it be calculated from the value of 
Dae206; 1.2. 
I 9 / 
S (nye) N = Ga (Ye Onzy? 4° Be.) 


taking B,’ = 3, its value = -39843, which is approximate enough for most practical 


purposes, 
(ES me) s vies) ce Su 
4 ( N oN 


taken actually, = +201,298 {-41701 — (-46179)?} = -041,0165, 


while the approximate value in (xvi) gives :037,2751. 


In the next place 


ay Y Ny sVer Vy g (Ms SN 
(1 1) 1s ( N ) ASG: 


taken actually, = — -068,270, while the approximate value in (xvi) gives — -O1517. 
Further*, 
1— 2) ; 2g (Mista Se Ye) : 9 x 1-63547 x -339.0234 
(1 =?) Bs S WN. 1-794,649 x 1-63547 x -339,023 


I 


-995,062, 


and will be seen to be of wholly different order to the terms already computed. 


| 


: OM VD eva is 
Again, We (“) = -28834 x -71166 x 2:67476 


= -548,861. 
Lastly we have the term 


r 2 
(1 — 2)2.8 ee fe (2) = -805,191 x 1-042,731 4 


nen, N \z, 
= -839,598. 
Thus substituting in (xiv) we have 


(Ciena or ene: re 
a £55 . 1165 + -839,598 — -068,270 — -995,062 
On = Fea yap CE8,861 + -041,0165 + -839,598 — -068,270 — -995,062} 
*805,191 « -366,144 he s 
IER Alone mae(ESINIR Te 
“. , = 03143, 
and Probable Error = -67449o, = -0212. 


* In this case y, is of the same sign throughout, but caution must be taken to see from the data 
whether for certain arrays it changes sign. 

+ Found from the table of y’, this is -338,961. 

{ Found from the table of ¥’, this is 1-044,911. The differences in this and the previous value 
for y’ depend on cutting off at 5 or 6 decimal places and in using in the different processes only first 
differences. 

20—2 
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For comparison we will investigate the value as given by the less exact formula 
(1x): 


jn TTT ae 051,338 + -31170 (-102,676 + -423,327) 
+ 9-739,422 (-548,861 + -839,598 — -995,062)] 
— -000,277,077 x 4-046,752 = -001,121,26. 
Hence o, = °03349, 
and Probable Error = -0226. 


Thus for all practical purposes (xix) gives as good a result as the more lengthy 
formula (xiv). Had the value r = -3204 been computed by the product moment 
method for a population of 1689 the probable error would have been -0147. Thus 
the accuracies of the product-moment and the biserial 7 methods appear to be 
about roughly in the ratio of 2 to 3 for this case. 


We will now turn back to the formula (xvii) and endeavour to express the 
quantities in curled brackets in powers of r or 7. We have 


iin erie 
oe a ese 
© WV Or V Or 
gee te Pate, Wy yf 2 2 
aaa LEY ST Vee Yama) 
277 
Hence 
wt (1 - es 3 1) Ul Sry se yy ey ya 
Me ee 2 (9)!2 1,2 2 2 l te 
=F l—ry 5 + 72 (y?) + dr apa) eae Sostebe (xx): 
, [-¥ (23%? 
Again del) Bes [ € dz, 
Ns Zr / ys 
and Ys=y—ry +47. 
Nn. v ieee AY, —ta? 
Thus 1s — 14 | e dx. 
Ns NW On Jy-ry't4rty 
Take x =y— 2’, so that 2’ will be small, thus 
ry ary 
i pa, (1 + ya’ + dy?x'?) (1 — $7) da’ 
n, N Jo 
= = + 2(ry’ — tyr? + dryy’?) + ete. 
Sunilarly : 
Mog Ya 


pt = ag — 2 (ry’ — byt? + dry?) — ete, 


AT a aL Ae ee ar Za 


= = ' Py Py Uy ae VAL aye Ga) > A fPX ub ee 
NN; N2 ‘i N ry CY cts Z N 2 Yr (1 Yy )4 etc....(xxil). 
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Ng S (ny) (eo 
vs) — pas) =O ea | 
But s( aie yO Oy 
ee isto MN — ViVe ba 
and accordingly : S ( ay *) ae aa etc. 


We have then 


Z N?2 n,N /| 


2 , \ 
p (”) ae S (“ iD == ye tering ail 1)" eceecaueastes (xxiii). 


Again turning back to (xxi): 


1 2 
ayy Fad (ry APs yr) sata e yay. 


Square this, multiply by (xxii) and »,/N, sum and note that sum of odd powers of 
y go out. We find 


: 2 y 1+y?)2 ; : 
- S jae (( 7?) a x) | ae ( oe ) terms in 7?...(xxiv). 


Substituting (xxii) and (xxiv) in (xvii) we deduce 


(1 — »?)? y Dy Se Mi VY» 
2 ee ae — - — i {e 2 1 tep 2 
0, N q+ 4 GQ 2)2 * Nae + terms in 7 


(1 — 7?)? | wy% " 2p ae q : 2 2 
N NZ ow fal AE + terms in 7°]. 


. — 2 9 2 1 
ISiceletalla.TaPSvaye p= ee Sele A 


Vo eee 


This expression is of value for two reasons. First it indicates that the probable 
error of 7 found for non-associated variates is 


67449 (yyy Wy YB ( 

Sivan iNet ft) Gling. ee 5 

And secondly (xxv) may give a good approximate value of the probable error of 
n, when we neglect the last factor altogether, say for values of » under -4 or -5. 
Thus in the illustration given above 


M¥2 , 2 yt _ (-548,861 + 1.40889 31170 | 
Ee ON 4? 31170 (131170)? 


= 1-4208. 


We have therefore to multiply the probable error as given by the product-moment 
method, i.e. in our case -0147 by 1-4208, giving -0209, in fair accordance with the 
more accurate value -0226, it being remembered that a probable error is rarely 
worth more than one or two significant figures. 
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Table of Values of the Biserial n Functions, and "’. 


| 

« v | v" x v y" 

00 0 | 0 2-00 -82357 30-5075 

05 03132 | = -00393 2-05 “83084 34-9079 

‘10 06258 | -01577 2-10 -83770 39-9960 

“15 09371 03563 2-15 -84416 45-8899 

-20 12465 | 06375 2-20 “85025 52-7294 

25 15534-10043 2-25 85599 60-6805 

-30 18571 | -14608 2-30 -86140 69-9410 

*35 -21572 -20121 2-35 -86650 80-7466 

-40 24530 | -26644 2-40 -87132 93-3785 

“45 27441 | -34251 2-45 -87587 108-1734 

50 -30299 -43030 2-50 -88016 125-535 

“D5 -33099 -53083 2-55 -88422 145-946 

-60 -35839 -64531 2-60 “88807 169-991 

65 “38513 77509 2-65 ‘89171 198-369 

:70 -41118 -92177 2-70 89516 231-930 

‘75 43651 1-08716 2-75 -89843 271-696 

-80 -46110 1-27337 2-80 -90154 318-909 

85 -48493 1-48278 2-85 -90449 375-075 

90 -50797 1-71815 2-90 -90731 442-028 

95 -53022 1-98263 2-95 -90998 521-999 

1-00 55165 2-27983 3-00 91254. 617-713 

1-05 -57228 2-61391 3-05 91498 732-506 

1-10 -59208 2-98961 3-10 -91730 870-463 

1-15 -61108 3-41242 3-15 ‘91953 1036-599 

1-20 -62926 388863 3-20 -92166 1237-088 

1-25 64.665 4-42549 3-25 -92371 1479-538 

1:30 -66324 5-03136 3-30 -92567 1773-344 

1-35 -67906 5-71590 3°35 -92755 2130-136 

1-40 -69412 6-49026 3-40 92936 2564-332 

1-45 70844 | 7-36736 3-45 -93110 3093-849 | 
1-50 -722038 8-36218 3-50 -93277 3740-988 | 
1-55 73493 | 9-49211 3-55 -93438 4533-572 | 
1-60 "74715 | 10-77737 3-60 -93593 5506-376 . 
1-65 75871 | 12-24149 3-65 -93743 6702-955 
1:70 76965 | 13-91195 3-70 -93888 8177-972 
1-75 77999 | 15-82082 3°75 94027 10,000-169 . 
1-80 ‘78975 | 18-00564 3-80 94162 | 12,256-169 . 
1-85 -79896 | 20-51038 3-85 94292 15,055:358 | 
1-90 -80765 | 23-38668 3-90 94418 18,536-141 

1-95 81584 | 26-69522 3-95 94540 | 22,874-013 
2-00 ‘82357 | 30-50747 4-00 94658 | 28,291-951 


| 


A PRELIMINARY REPORT ON SOME BREEDING 
EXPERIMENTS WITH FOXGLOVES. 


By ERNEST WARREN, D.Sc. (Lonp.). 


In 1914 in the garden of the writer there occurred among some ordinary fox- 
gloves (Digitalis gloxiniaeflora) a specimen of the variety known as monstrosa in 
which there is a large, more or less regular, peloric or crown flower arising by the 
fusion of a variable number of ordinary flowers at the top of the axis. The idea 
arose that it would be of interest to ascertain whether this peloric character would 
blend in the offspring when the two forms of foxgloves were crossed, or whether, 
as would appear more likely, the character would be transmitted in its entirety 
or not at all. 

In working at this particular character it was found that the shape, the spotting, 
and the general intensity of colouration of the ordinary zygomorphic flowers varied 
greatly, and accordingly a record has been made of these different characters in 
the first generation. The second generation is now being raised; but, as it will be 
some time before flowering takes place, a preliminary report on the results of the 
observation on the first generation will be of interest, since they indicate the 
general trend of the experiment. 

I am much indebted to Dr Conrad Akerman for making most of the measure- 
ments and for ever-ready assistance in the experiment; to Mr A. W. Cullingworth, 
Mr W. G. Rump and Miss R. Meanwell for general help in the cultivation of the 
plants, mounting the pressed flowers, etc., and to Mr George Carter, Mr W. Bain- 
bridge, Mr C. R. C. McDonald and Mr H. C. Burnup for growing some of the plants 
in their gardens, also to Mr J. F. Nicolson for allowing me to examine and take 
specimens of plants from his private grounds. 


I. Material. 

The parent plants were the following: 

(1) Medium-purple crowned foxglove raised from seed from Erfurt, Germany 
= Al, 

(2) Very-pale-purple crowned foxglove from Mr George Carter’s Nursery, 
Pietermaritzburg, raised from Erfurt seed = C (1). 

(3) White ordinary foxglove (Carter) = C (2). 

(4) Light-purple ordinary foxglove (Carter) = C (3). 

(5) Medium-purple ordinary foxglove from Mr P. T. Badock’s garden, 
Hilton Rd = B (1). 
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(6) Pale-purple ordinary foxglove (Badock) = B (2). 
(7) 
(8) Dark-purple ordinary foxglove (Badock) = B (4). 
(9) Dark-purple crowned foxglove from Mr J. F. Nicolson’s garden, Boshoff 
Rd; seed from Messrs Dobbie, Edinburgh = N (1). 
(10) Dark-purple creased lip ordinary foxglove (Nicolson) = N (2). 


Medium-purple ordinary foxglove (Badock) = B (3). 


These plants were variously crossed or self-fertilized so that seed-capsules of 
some 36 different pairings were obtained. Unfortunately many of the families 
were completely killed soon after germination owing to scorching by the semi- 
tropical sun of Natal. There were ultimately obtained records of the flowering 
of 188 plants, the product of 19 families. In every case the contents of one capsule 
only were sown. Sowing was done in the middle of December, 1914, and flowering 
occurred in October and November of 1915. Most of the plants grew very 
luxuriantly and the flower-axes reached a height of 4 to 5 ft. 


The following is a list of the families showing the parentage (seed-plant placed 
first) and the number of plants which flowered. Except where indicated the pollen 
was taken from the ordinary zygomorphic flowers: 


[Ta A (self-fertilized)... 35 XI A x C\(3) eae 
I C (1) (self-fertilized) 7 XI 1 Ci(3)exeAe 1 
HW Ca) x sBi(4) ie ers XV A xeCx(2) 14 
III B(4) x C(1) sheet) Ve, (2) A 8 
IV B (A) (self-fertilized) 5 XVa Ax C(2) 7 
Ve 2 C.() x BIG)) pases XVB C(2)x At 1 

VIII C(1) x C (2) nee. M0) XXIV Ax B(4) 13 
TX.) “Cy2) x C1) sae JD) XXV_ B(4)x Aft 6 

IXa C(1) x N (2) ay ae XXIX Ax N (1) e) 
X  C (2) (self-fertilized) 11 


All the families except I a were grown in beds on a piece of land close to the 
Natal Museum. Ia plants were raised in different private gardens of Pieter- 
maritzburg. 


Il. The Inheritance of certain Characters. 


The characters dealt with are: 

(1) The possession of a crown or a terminal peloric flower at the top of the 
flower-axis. 

(2) The presence of brown spots in place of dark-purple spots on the lower 
inside surface of the ordinary zygomorphic flower. 

(3) The amount of spotting on the lower inside surface of the ordinary flower, 
expressed as the percentage of area of this surface covered by the spots. 


* Pollen from crown flower. 
+ Pollen from terminal crown flower of side flower-axis. 
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(4) The intensity of the purple colouration over the general outer surface 
of the ordinary flower. 


(5) The ratio of the maximum width of the corolla at the mouth to the total 
mden x 1000. 
length 

With reference to the spotting on the inside of the flower the percentage area 
covered by the spots was estimated by the help of a diagram consisting of a series 
of equal sized rectangles each of which was spotted to a known extent: 4%, 8%, 
MT a Os Opn tes 64%. 


For estimating the intensity of the purple colouration over the general outer 
surface a colour scale was obtained in the following way. A very weak solution* 
of a mixture of aniline dyes was prepared which resembled the purple colour of 
the flowers. This solution was placed in upright rectangular glass jars of varying 
internal dimensions. With transmitted light the intensity of colour, of course, 
varied according to the depth of fluid. Jars of internal dimensions, measured 
from back to front, of 20 mm., 30 mm., 40 mm., 50 mm., 60 mm., 70 mm., 80 mm., 
90 mm., 100 mm., 140 mm., 280 mm., were employed, and the colour of the solu- 
tion, as viewed from the front of the jar with light coming from behind, was com- 
pared with the general colour of the flowers. Thus, if the colour of the flower was 
nearest, say, to the colour of the solution in a jar with a thickness of fluid of 70 mm. 
the colour of the plant was said to be 70. For a comparatively small range, as 
from 20 to 100 mm., the method appeared to work satisfactorily, but less reliance 
can be placed on the greater depths. In any case a scale of some definite physical 
significance is obtained. 


length of the corolla, expressed 


The colour of the flowers is due to the presence of a purple cell-sap. In the 
case of the general colouration of the whole surface both the outside and the inside 
epidermal cells bear this coloured sap. In the spots, where the colouration is much 
more intense, the coloured cells are confined to the epidermis on the inside of the 
flower. In flowers which are white, with the exception of the deeply coloured 
spots, there may be some exceedingly faint traces of isolated streaks of purple on 
the inside of the flower, these are due to a slight colouration of the inside epidermal 
cells. 

The flowers were measured with compasses and scale. The width was the 
maximum distance between the reflexed margins of the mouth, measured from 
right to left, and the length, measured along the adcauline surface, was the distance 


from the middle of the straightened-out margin of the mouth to the middle of the 
insertion-edge of the corolla. 


1. The peloric Character. 


The peloric flower at the top of the axis would appear to arise by the fusion of 
a varying number of the uppermost flowers and the concurrent entire suppression 
of the internodes between such flowers. Usually the apex of the main axis is un- 


* Water 100 c.c., rubin 2-375 mg., gentian violet 0-125 mg. (Griibler), 
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affected, and after the peloric flower has withered it continues to grow through 
the centre of the crown and may produce a considerable number of zygomorphic 
flowers. On this out-growing axis a second and subsequently even a third peloric 
or crown flower may be produced. These are always less perfect than the first. 
In other plants the peloric flower is permanently the terminal flower, and the 
main axis ends in ovary and pistil. 


In plant A the peloric flower consisted of four fused flowers, in plant C (1) 
of about eight flowers, and in both cases the main axis continued to grow on the 
withering of the crown. In N (1) the peloric flower was a regular deep cup consisting 
of only three fused flowers and the main axis terminated in an ovary. 


In the case of side flower-shoots the axis seldom continues to grow, but 
generally terminates in a pistil. In plant A the terminal flower of the side shoot 
consisted of only two flowers which had fused laterally and the resulting structure 
was bilaterally symmetrical. In plant C(1) the side shoots terminated in a 
beautifully regular shallow cup consisting of three fused flowers. 


In connection with the formation of the crown flower it is clear from the arrange- 
ment of the stamens, and the lobing and spotting of the corolla, that the structure 
arises as if the flowers had split along the middle line of the upper surface, spread 
out and fused with their neighbours along the split edges. 


As an abnormality there occasionally occurred among the plants zygomorphic 
flowers completely split along the upper surface: they were otherwise quite normal 
flowers. Sometimes where split the edges gaped apart, and thus there is a 
tendency for the flowers to arise in a flat, leaf-like condition. 


It may be supposed in fact that the complete suppression of certain inter- 
nodes causes a definite number of flower-buds to fuse together in such a manner 
that when growth occurs the resulting structure has the appearance of being 
produced by the joining together of several zygomorphic flowers which have been 
cut open along the middle line of the upper surface and have been connected 
together to form a cup or shallow saucer. 


It was soon found that imperfect crowns of different degrees of incompleteness 
could occur, and this fact raised the presumption that in the cross of a crowned 
plant and an ordinary plant the offspring would exhibit more or less intermediate 
characters. 


It seemed quite hopeless to attempt to find a natural scale by means of which 
the intensity of the peloric character in different plants could be measured, and the 
following empirical grading was adopted. 


(1) Plants in which the main axis carried a perfect crown of a variable number 
of flowers (3, 4, 8) was placed in the 100 °%% grade. On some few plants the corolla 
of the crown flowers was quite small and green, or even entirely absent, but the 
arrangement of the sepals indicated clearly that the peloric character was fully 
developed; such plants were also placed under the 100 % grade. 
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(2) Sometimes the main axis had its terminal portion spirally twisted in 1-2 
turns, and in this region the flowers were excessively crowded owing to the great 
reduction in the length of the internodes. The flowers were very abnormal, and were 
usually split open and formed large, flat, coloured, petaloid structures which matured 
long before the lower normal flowers of the axis. The side shoots of such plants 


usually carried more or less perfect crowns. These plants were placed in the 
75 % grade. 


(3) Plants in which the main axis was ordinary and carried no crown, but the 
side flower-axes terminated in crowns, consisting of two or more fused flowers, 
were placed in the 50 % grade. 


(4) Plants in which the main axis was markedly dwarfed, lumpy and curved, 
owing to the shortening of the internodes, but the flowers more or less normal, were 
placed in the 50 % grade. 

(5) Plants in which the terminal portion of the main axis was somewhat 
curved, in which the upper internodes were reduced and of which the side shoots 
also showed such a tendency, were placed in the 25 % grade. 

Spiral bending of the axis always occurs with the peloric condition when the 
crown consists of more than three flowers and the axis continues to grow through 
the crown. It is doubtless caused by the suppression of the internodes. 


In the following table the results obtained in the first generation are summarized. 


Offspring 
Parentage Crowned or tendency for same 
No. | | | | Not 1 
100 % | 75 % 50 | Pies LE eS a 
es | crowned 
_ : —- 
Ia. A (crowned 100%)? x gS (self) ...| 33 | 38 | 0 | 0 | O 33 0 
I. C (1) (crowned 100 %) )@ x g (self) ... 7 3 0 | 4] 0 7 0 
X. C (2) (ordinary) 2 x ¢ (self) ... cer | med all ae i 0 3 8 
IV. B(A4) (ordinary) 2 x ¢ (self ~ 5 OV O10 0 0 5 
VIII. -C (1) (crowned) ae x C(2) (ordinary) ‘3 | 10 2 0 Ty @ 3 7 
IX. (C (2) (ordinary) 9 x C (1) (crowned) 2 10 5 0 1 6 4 
II. (€ (1) (crowned) 2 x 5 (4) (ordinary) | 8 0 OF 0) a0 0 8 
( IIT. B (4) (ordinary) 2 x C (1) (crowned) |) 5 OF) 0 0 0 0 5 
V. C'(1) (crowned) 2 x B (8) (ordinary) 3 | 23 0 0 Oo | 0 0 23 
XI. A (crowned) 9 x C (3) (ordinary) 3.. 17 2 4 BP dl 10 7 
| XII. C (3) (ordinary) 2 x A (crowned) 3 .. 1 0 1 0) 0 1 0 
a A (crowned) 2 x C (2) eo lace 14. 7 2 4Or. | 50 9 5 
XV. C (2) (ordinary) 2 x A (crowned) g ... 8 2 OR eal 0 3 5 
XVa. A (crowned) 2 x C (2) (ordinary) ¢ ... 7 2 0 0 0 2 5 
XVzs. C (2) (ordinary) 2 x A (crowned)  ... 1 to) 0 Ont ial 1 0 
IXa. C (1) (crowned) @ x WN (2) (ordinary) 3 fad! 1 0 0 | 0 1 3 
XXIX. A (crowned) 2 x N (1) (crowned) g .. 4 4 0 OPO 4 0 
XXIV. A (crowned) @ x B (4) (ordinary) 3 13 0 0 0 | 0 0 13 
XXV. B(A4) ee . x A ( ae ey 3 6 0) 0 OO 0 6 
Totals 83 |) «104 
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It will be of interest to compare these figures with those that would be expected 
if the character behaved as a Mendelian unit. In the experimental results it will 
be noticed that all plants showing any tendency for a crown are grouped together 
as opposed to the plants exhibiting no such tendency. 


To obtain the theoretical figures, and speaking in Mendelian terminology, it 
will be necessary to make certain assumptions as to the homozygous or heterozygous 
nature of the parent plants with regard to the presence or absence of a crown. 
The absence is dominant over the presence of a crown. 


In the cases of parents A, C'(1), C (2) and B(4) we have experimental evidence 
of the gametic nature of the plants. Reference to the above table will show that 
A and C'(1) are pure recessives (families I a and I), C (2) is heterozygous (family X), 
and B(4) almost certainly homozygous dominant (family IV). For the other 
parents we assume that B(3) and B(4) are homozygous dominant, N (1) pure 
recessive, and C(3) and WN (2) heterozygous. 


With these assumptions, which are warranted by the behaviour of the parents 
in the various crossings, we can calculate the theoretical expectation for each 
family. The totals, to the nearest whole figure, are 83 plants crowned and 
104 plants not crowned. The experimental result as shown above gives the 
same figures. In the light of other investigations this identity between the 
theoretical expectation and experimental result is to be regarded, of course, as 
unusual and accidental. 


Of the 83 plants with a crown or a tendency for a crown the number in each 
grade is given below: 


| 100% 15 %, 50 % | 25 %, Total 


| 
62 | 8 Lira a2 83 


| 


In the various crossings of crowned parents and non-crowned parents there is 
thus a marked tendency for the character to be transmitted in its entirety or not 
at all. Notwithstanding this fact it nevertheless appears that the character can 
be diluted ; for example in cross no. XI, A (crowned) 2 x C (3) (ordinary) 3, we have: 


Crowned 
= ——— —_— ae — 7 Not | 
| ie crowned 
100 % | LO, 50% 25% 
2 4 | 3 1 7 taal 


Thus, although the character appears to be capable of blending yet it is usually 
transmitted as a simple Mendelian unit. 


— eT ee Oe ee ee 
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2. The Presence of Brown Spots. 


Among the various families there occurred plants with white corolla which 
possessed brown or greenish spots in place of the usual dark-purple spots. Such 
brown-spotted flowers only occurred when the corolla was white. If there was 
the faintest trace of purple on any part of the general surface of the corolla the 
spots were almost invariably purple. On the other hand the intensity of the 
colouration of the spots appeared to be in no way related to the intensity of the 
general colouration of the corolla, since some of the darkest coloured spots were 
found in flowers with an otherwise white corolla. In the terminology of the 
Mendelians it may be said that distinct factors control the general colouration of 
the corolla and the colour of the spots. 


In this connection it is interesting to remember that the general colouration 
of the corolla is due to coloured sap in the cells of both the outer and inner epidermis 
of the flower, while the colouration of the spots arises through the colour in the inner 
epidermal cells only. 


A microscopical examination of the brown spots revealed the presence of a 
granular brownish substance and of a great accumulation of typical starch grains. 
Such starch grains were few or absent in the surrounding epidermal cells; also 
they did not occur in the normal dark purple or crimson spots. The brown sub- 
stance is almost certainly a decomposition product of anthocyanin, the normal 
colouring matter. Dr F. W. Bews, Professor of Botany, Natal University College, 
has suggested that possibly the starch also is to be regarded as a decomposition 
product of the anthocyanin: but the great abundance of the starch somewhat 
militates against this view. 


The brown substance is insoluble in alcohol, ether, HCl (50°) and potash 
(1 %)- 

With the naked eye there is a sharp discontinuity between white flowers with 
purple spots and white flowers with brown spots. Microscopical examination, 
however, shows that the discontinuity is not really so sharply defined. In many 
of the coloured cells of the purple spots traces of the brown substance may be 
detected, and at first it was thought possible that the removal of the purple colouring 
substance with alcohol would leave brown spots. This, however, was found not 
to be the case. Only rarely is the brown substance in sufficient quantity to mark 
out the spot on the removal of the anthocyanin. The point to notice is that brownish 
purple spots intermediate between the normal purple spots and the brown spots 
are only rarely found. Traces of an intermediate condition can be found micro- 
scopically, but there appears to be normally a gap between a small amount of 
decomposition product and total decomposition. 


It will be remembered that there occurs a similar gap between ordinary foxgloves 
and crowned foxgloves; intermediate conditions are possible but only rarely 
occur. 
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If we assume that absence of decomposition (purple spots) is dominant and 
total decomposition (brown spots) recessive, and that the parents which produced 
white offspring were heterozygous dominants with respect to this character, we 
can compare the experimental result with the theoretical Mendelian expectation. 


| | 
| Offspring 
(He ieee ts) Observation Theoretical expectation 
Purple spots Brown spots Purple spots | Brown spots 

X. CC (2) (self-fertilized) 9 2 8-25 2:75 
Ia. Ai (seif-fertilized) ... 26 7 24:75 8:25 
XVa. (AQ x C(2)g of 6 1 | 5-25 1-75 
XVB. |(C(2)2x AS 0 1 0-75 0-25 
| XE .f49x« C(3)s 12 5) 12-75 4:25 
XII. \(C(3)9xAs 1 0 0-75 0-25 
XIV. {AQ@x C(2)3 9 5 10-50 3-50 
XV. \|C(2)9xAS | 6 2 6-00 2-00 
Totals 69 23 69-00 23-00 


The assumption as to the heterozygous nature of the parents is warranted by 
the consistency which is shown when a supposed heterozygous dominant is crossed 
with an undoubted homozygous dominant. Thus we have: 


| Offspring 
Nl 
: ; | vee. Theoretical 
Parentage | Observation expectation 


Purple eee Purple | Brown 


C (2) Heterozygous dominant 2 x C (1) IO) ile 0) 10 0 


homozygous dominant 3 | 
C (1) Homozygous dominant 2 x C (2) 10 | ) 10 0 


heterozygous dominant 3 


Totals ... ae ae NY AOE salle OY St SKA) 0 


The fact that the observed result and the theoretical expectation in the above 
two tables should be identical may be regarded as somewhat accidental, but never- 
theless the agreement is rather striking. 


3. The Amount of Spotting on the inner Surface of the Flower. 


By means of the diagram already referred to the percentage area, with respect 
to the lower portion of the inside surface of the flower, covered by the spots was esti- 
mated in six ordinary zygomorphic flowers in each plant. This character appeared — 
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to exhibit no discontinuity in its inheritance. All intermediate conditions seemed 
to occur in the offspring; for example a parent with much spotting crossed with 
a parent with few spots tended to have offspring with a medium amount of spotting. 
It was quite impossible to sort out the parents or offspring into two or more groups 
of “much spotting,” “little spotting,” “medium spotting,” etc. As far as can be 
judged from the material at hand the units of spotting, if such occur, are so small 
that the series is practically a continuous one. 


The only satisfactory way of dealing with such a character which exhibits 
no discontinuity is the statistical method of Galton and Pearson. From the nature 
of the crossings it may not be always possible to deal statistically with the relation- 
ship of both male and female parent to the offspring. For example, if a single plant 
be crossed with the pollen of a dozen different plants, and families raised, there 
will be one female parent only, but 12 different male parents and twelve different 
mid-parents. Such a method of procedure is not free from objection, but it is the 
only course that is possible with the material at present available. In each plant 
the amount of spotting was determined in the bottom six flowers of the main 
axis, that is in the first six flowers which opened. 


Another point to note is that some of the families were raised from seed obtained 
by self-fertilization. Now, the differences, if any, in the strength of inheritance 
in parthenogenesis, self-fertilization and cross-fertilization have not yet been fully 
investigated from an experimental point of view, and accordingly the heredity 
constants have been calculated both with the inclusion and the exclusion of the 
self-fertilized families. 


Ordinary correlation tables have been made, utilizing the six flowers of each 
plant, and Prof. Pearson’s constants have been calculated. 


As an example the following table is given: 


Spotting : Mid-Parent and Offspring (omitting self-fertilized Plants). 


Mid-parent, | | | 
percentage 54 | 96 189 144 105 91 45 31 12 6 | Totals 
of spotting | | | 
é—I1 34 36 36 2 — — — — — | — 108 
12—165 20 43 72 39 u | 181 
16—19 — Saul Oo 55 47 42 28 24 t 2 258 
20—23 = — = 4 5 2 1 — = -- 12 
24—27 = By | eae 39 18 13 u — —— — 113 
28—s1 = 5 1 5 28 29 2 2 4 | 2 78 
32—35 — — = = = 5 7 5 4 | 2 23 
sb & a a) a> 9 ‘ 
| gS o5 i I ~N iS) my 55 33 33 
a g i 8 AS ‘ xX Ns) S } 3 4 il EG 
3 3 a <s) Ni S R n | 1 89 oD | 
{ 
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The general results may be summarized in the form of a table: 


| No. of | No. of | 
* | mid- off- 


iparents| spring | 


| No. of 
| flowers 


Mid-parent and offspring (including 
self-fertilized families) 

Mid-parent and offspring (excluding 
self-fertilized families) 

Male parent and offspring (including 
self-fertilized families) 

Male parent and offspring (excluding 
self-fertilized families) 


be iisanl =ei76 

lL 127 
| tea 1 a6 
|u| aa7 


Standard | Standard | Gocfiient 
oe : Snes correlation 
| 10509 | 8500 | .-563 
6-610 7-790 | 545 
16-733 | 8-595 | -431 
15-249 | 7-790 |  -820 
| 


With the small number of parents available the constants are bound to be 
erratic, and the results can only be regarded as giving a rough indication of the 
strength of inheritance. The general tendency is for a low parental correlation: 
the mean of the two paternal correlations is *38. The average of some 52 correla- 
tion coefficients of parental heredity obtained from a variety of different animal 
species with various characters has been given by Prof. Karl Pearson as -45, while 
the mean of the parental coefficients for a series of characters in the Shirley Poppy 


was ‘33. 


4. General Colouration of Corolla. 


We have seen that the general colouration of the corolla is due to coloured 
sap in the epidermal cells of both the inside and outside of the corolla. The 
intensity of this colouration was estimated, as already described, by a colour-scale 
obtained by viewing by transmitted light varying depths of a standard tinted 


solution. 


There is discontinuity in the colour of the flowers of the plants. The 


flowers can be divided into (1) purple and (2) not purple (white). For this sub- 
division a purple flower must be defined as one in which the general surface of the 
corolla is coloured. The colour may be quite pale or very dark. A white flower 
is one in which the general surface is white, but there may be a few extremely 
faint blushes of purple on the corolla, especially on the inner surface. 


Such definitions may seem empirical and unsatisfactory, but nevertheless 
it was found in practice to work quite easily owing to the fact that exceedingly 
faint general colouration of the whole corolla did not occur. 
gap in the series between faintly “purple flowers” and the “white flowers.” In 
the whole series the palest purple flower was represented by a depth of 15 mm. of 
coloured fluid and there was only one such plant. 


There was a clear 


In the accompanying table the families are arranged in the order of the mid- 
parental colour—the mean of the colours of the male and female parents. 
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By reference to the horizontal line of totals at the bottom of the table it will 
be seen that among the purple offspring there are none under the first three units 
of the colour scale, 0-4, 5-9, 10-14. There would also. appear to be discontinuity 
at the other end of the scale, but it is very doubtful as to how far reliance can be 
placed on the colour-scale where the depth of the fluid is considerable. 


Taking the families altogether there were 139 purple plants and 41 white. 
The Mendelian expectation may be calculated for each family on making the 
following assumptions: purple colour dominant to white; C (2) pure recessive; 
A, C(1) and C(3) heterozygous dominants; B(3), B(4), N (1) and N (2) homo- 
zygous dominants. 


Justification for these assumptions is given by the consistency with which 
the different parents act in different pairings. The sum totals of the calculated 
expectations for all the families are 131 purple and 49 white. It will be seen that 
these figures are in very fair agreement with the experimental result which is 
139 purple and 41 white. This may be regarded as a typical Mendelian result. 


A point of interest is to ascertain as to how far the various intensities of the 
colouration among the purple plants are inherited. It is possible that some 
observers would regard the different intensities of the purple colouration in a 
family as representing mere fluctuating variations which could not be inherited. 
According to the pure-line theory the different intensities of purple colouration 
occurring within the limits of a family resulting from self-fertilization cannot be 
inherited. It will be possible to test this theory in the generation of plants 
now being raised, but the present material cannot give a definite reply to the 
question. 


An inspection of the above table (p. 313) will show that as we pass to the higher 
mid-parental colours there is a gradual rise in the colouration of the families. 
This implies that the various shades of purple are not distinct types, but that 
they can blend perfectly with one another. 


The fact that in closely similar matings the families raised show a regular 
progressive rise in the intensity of colouration with the rise in the mid-parental 
colouration does not favour the expectation that any support for the pure-line 
theory will be forthcoming from the present investigation. 


It is desired to find the correlation with respect to the intensity of colour 
between the purple parents and the purple offspring. or this purpose, in order to 
have strictly homogeneous material, the parents should be homozygous dominants. 
The material will not permit such a calculation. The purple plants of the different 
families include both homozygous dominants and heterozygous dominants. Now 
the homozygous dominants are found to be darker in colour than the heterozygous 
dominants, and so in any family consisting of an equal number of heterozygous 
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and homozygous dominants we may approximately divide the individuals into the 
two groups by regarding the darker half as homozygous and the lighter half as 
heterozygous dominant. In this way we can form a correlation table between 
the parents and the heterozygous or the homozygous offspring. 


Using the symbols introduced by Prof. Bateson, namely RR homozygous 
recessive, DR heterozygous dominant, DD homozygous dominant, we have, on 
crossing individuals of the different gametic constitutions, the following theoretical 
results : 


RR x DR=1 DR (light purple) + 1 RR (white), 
DR x DR =1 DD (dark purple) + 2 DR (light purple) + RR (white), 
DR x DD =1 DD (dark purple) + 1 DR (light purple). 


In calculating the correlation it is advisable to use only parents and offspring 
of similar gametic constitution; thus, DR parents with DR oflspring, or DD 
parents and DD offspring. RR parents and RR offspring being white are 
excluded. 


To accomplish this it is necessary in any given family, obtained by crossing 
together RR, DR and DD parents, to sort out the purple offspring into DR and 
DD individuals. To do this with complete certainty it would be necessary to self- 
fertilize each individual and obtain families. The labour involved is too great 
to be undertaken lightly. Owing to the fact that DR ofispring are paler than 
DD ofispring we may approximately divide the families into the two groups 
of individuals by taking the darker half (in the case of DR and DD parentage) 
or the darker one-third (in the case of DR and DR parentage) of the plants 
as being DD individuals in constitution and the remainder as being DR 
individuals. 


Thus for example: 


| Parentage Family under Colour-Scale (see preceding table) 


| its | 

| Vill (1) x C (2) = DR x REF | 1, 2, 4 (DR), RR (excluded from table) 

| I. C(1) x C(1) = DR x DR|1, 0, 1, 1, 2, 1 (4 DD and 2 DR) = 1,0, 1 DD + 1, 2,1 DR 
(IXa (1) x NV (2) = DR x DD | 1, 0,3 (4 DD and $ DR) = 1,0,1DD+2 DR | 


In this way a correlation table can be made with fair accuracy between DR 
parent and DR offspring. The paucity of the material, however, prevents much 
reliance being placed on the actual figures obtained. At most the figures can give 
only a rough indication of the strength of inheritance. 


When a heterozygous purple plant is crossed with a white plant (recessive) 
both purple and white offspring are produced. The coloured offspring, however, 
tend to be paler than the coloured parent. Thus the colour of the coloured 
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DR Female Parent and DR Offspring. 


Female 
eel 2.1 °6 | 8 98 | 8 48uiae |e! b | 12 } 8) 2] 1 | 1 | Totals 
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30—34 —/|4 oy | 25s 3 2 8} 4)/—};—}]—]—]—]|— 29 
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DR Male Parent and DR Offspring. 
Male parent, | | | 
general 1 2 3 6 5 2 a1 3 1 5 6 7 1 1 | Totals 
colouration | | | | | 
| | 
30—34 — | — | 2a! se) aoe o. 1820 20 +1 om aie an a 
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parent has been diluted by crossing with the colourless parent. Since such 


blending occurs it will be of interest to find the correlation between the mid- 
parental colour and the colour of the DR offspring. The DD and DR offspring 
were separated from one another by the method above described. 
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Mid-Parent and DR Offspring (including self-fertilized Families). 
l | 
Mid-parent, | | | | | | 
general Tou non eomlelg 4. \ 3 }10| 6 | 9 |12/ 9] 3 | 2) 1 | Totals! 
colouration | | | | | | 
| 
lane ea : re | 
emerge ae ege Say ieee toe oe P|) he Ih ge 
20—24 | — | 0 
25—29 | | | —)—)—|— 0 
30—34 sod leeee al oa al) | | 4 
35—39 1 2 3} 3} ||) all OF Ont 3 5} 1 }—]—]— 21 
40—44 | | | — 0 
45—49 | (e | 0 
50—654. tae ae ee 4/4] 2)|—|— la} 14 
55—59 See es ae en) ON SO aL eO ON On) Te) 7 
60—64 SLO | 4) 0; 0,0) 1] Oh 1) 2)—)|—)—|— 9 
65—69 —)}— | | | | 0 
70—74 = te RO an set Gel, 40 ein) 15 | 
75—79 hens =) Tella Cone Oe Da) 2 Toe 
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b= § Saiwlialwxialxtis oie eee > | | 
STG TIT TTT 
#h2 (S/R RF SPSS Sissies sss is,sx, | 
: | | ie ean | | 
The general results may be summarized as follows: 
] ] | 
No. of | No.of] qa. 4.4 | Standard | Co- 
Relationehi mid- DR | pended deviation | efficient 
tie Aen ee parents off- | Ee aen te _ of DR _ |of corre- 
| or parents spring | a ae offspring | lation 
Mid-parent colour and colour of DR offspring 17 92 | 19-280 | 17-520 | -675 
(including self-fertilized families) | | 
Mid-parent colour and colour of DR offspring | 1155 74 18-514 17-800 587 
(excluding self-fertilized families) 
Male parent (RD) colour and colour of DR off- 8 44 17-249 18-336 471 
spring (including self-fertilized families) 
Female parent (DR) colour and colour of DR 12 | 76 19-351 16-563 ‘415 


offspring (including self-fertilized families) | 


We see from the above that the strength of inheritance, with respect to the 
intensity of colouration, between individuals of similar gametic constitution 
follows a similar statistical law to that exhibited by characters where discontinuity 
cannot be detected, where complete blending occurs and no trace of segregation 
can be found. 


It might have been anticipated by some that the varying intensities of the 
general colouration among individuals of similar gametic constitution were casual, 
fluctuating variations which could not be inherited. The above results point to 
the view that small variations of this nature are inheritable, and consequently 
they can be of importance in the evolution of species. 


318 


5. 


Breeding Experiments with Foagloves 


The Ratio of the Breadth to the Length of the Corolla. 


The parent plants differed markedly in the relative width of the corolla. C (1), 
C (2) and N (1) were wide, B (1), B (3) and B (4) were narrow, and the remainder 
were of medium width. As in the case of the amount of spotting of the corolla, 
no distinct discontinuity in the variation of the character could be noticed either 
As far as could be judged the variation was a con- 


in the parents or offspring. 


tinuous one, or one with such small steps as practically to be equivalent to the 


same. 


The following tables of correlation are given as records: 


Ratio, Mid-parent and Offspring (excluding self-fertilized Families). 


Mid-parent. | | | | | | he om abe | 
Breadth | jo eA 2833 Onea taeda 80 | 102 | 103.80 61 | 45 | 30 | 31 Oo Tah Oa iad 3/1 Totals 
Length | | | | ee | 
r * er | | 
503—516 fey De Ge lean 16| 14] 15 TY | 7g) 3) al sl lll 114 
517630 |— 2 | 3] 8) 0) LikOkigie we) 1 ee ere 
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587—600 a LV) 7) S610) | 07) 95) 1G Se 18s 4s GS eae Ge a i |e) 
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BS | bp Na alee el tid laidialalsiaigleisiae 
og|s VISIV SSIS IS Bl Ss; Ss srs sl S SiIsSls/slsi isis 
ial | | | | | 
| | | | | | 
Ratio, Male parent and Offspring (including self-fertilized Famalies). 
] r | | | 
| Male parent. | | 
Breadth 1 | 9 | 27| 58 | 86 | 121 | 145 | 164 | 147 | 109 | 74| 28] 9 | 16] 5 | 1 | Totals 
———- 1000 
Length 
| 
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520—5359 1 | 4 | 15 | 25 | 23 | 30 48 | 53 48 29 | 13 ye 74 Bei) |) 298 
540—559 | — Ff — | — | — | — | —  — } — | -— | KH 0 
560—579 — | 2 4/} 8/13] 36 BEB Sil 35 | 25 | 28°) 18) 6 107 2a 252 
580—599 | — —}|—]/— 0) 
600—619 —|—|]— 3) 47) 13 15 27 25 32 | 25 4) 1 o | 3 156 
2 | | 
Ue Mel tle lle et I | | 1000 
G'S | Bo Sis iS isisisiers 18 | 8 Ss i Naesaleees 
Oo 2/8 ; | si SS Solas iD ww | % |S | SS |S |S Tico 
fA IH | | | | 
| 


ERNEST WARREN 319 


| ees | | Standard | Standard | 
; E No. of | No. of | No. of devia- deviation | Coefficient 
Relationship parents | off- flowers tion of of | of — 
spring parents offspring | correlation | 
Mid-parent and offspring (in- 15 176 | 1000 29-626 49-220 554 
cluding self-fertilized families) | | 
Mid-parent and offspring (ex- 1 127 7174. 25-886 47-164. “509 
cluding self-fertilized families) 
Male parent and offspring (in- Hose 76 1000 38-783 | 49-236 341 
| cluding self-fertilized families) | | 
| Male parent and offspring (ex- 1] 127 774 38-898 | 47-358 “300 
cluding self-fertilized families) | | 
| } 


The parental correlation is thus low as in the case of the area covered by spots. 
Such a condition increases the probable error of the numerical results, and many 
more parents would be necessary in order to obtain more reliable figures. Never- 
theless it may be noted that the statistical results obtained with the various 
characters are very fairly consistent with one another and the parental inherit- 
ance of the characters investigated tends to be somewhat low. 


IlI.. The Variability of the Race and of the Families. 


According to the pure-line theory the variability within a family consisting 
of individuals of similar gametic constitution and arising by self-fertilization of 
the parent should be small, supposing that no new mutation should arise, and 
it is of such a nature that it is incapable of being transmitted by inheritance. 
Selection within such a family would not be effective. In other words, families 
obtained by self-fertilizing individuals at the two ends of the range of variation 
would exhibit the same mean. The material at present to hand will not throw 
direct light on the question, but indirectly it lends no support to the theory. 


The comparative variabilities of the race and of the individual families are 
given in the table on p. 320. The colour of the different flowers on an axis 
was found to be essentially uniform; but in the two other characters there was 
some variability among the flowers on the same axis. Therefore in these two 
characters (the area covered by spots, and the ratio of the width to the length 
of the corolla) the mean of the six basal flowers of the axis was taken as expressive 
of the nature of the plant. 


Thus the mean variability of a family is 67 % of the variability of the whole 
race. Owing to the fact that the majority of these families arose through cross- 
fertilization this large family variability is not a proof against the pure-line 
theory. It may be stated here that with the four self-fertilized families raised 
there was some tendency for a reduction in the variability, but nevertheless it 
remained very considerable. 


The average percentage variability of the self-fertilized families is 50 % of 
the variability of the race. This is to be compared with 67 % for cross-fertilized 
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families. The number of the families is, however, too few for drawing any absolute 
conclusions. With such a high variability in these families it would appear im- 
probable that the families obtained by self-fertilizing the individuals at the 
extremes of the range would both exhibit the same mean. It may be added that 
in the parthenogenetic aphis (Hyalopterus) it has been shown that the variability 
of the family amounted to 60 % of that of the race. 


l 
| | No. of Mean 
_ No. of — Standard families No. of standard |S.D.fam. 
(Ghancten individuals | deviation | with six |. ae da js | deviation | 8.D. gen. 
in genera- | (S.D.) of or more smprihdmeaieags ||,8((S4)D))) Gi x 100 
tion | generation | individuals the families 
Intensity of general 90 3-490 8 69 2-349 67 
colouration (DR in- | | 
dividuals) | 
Ratio Breadth rola | 172 3-324 13 154 2-414 73 
Length 
Spotting 178 1-816 13 160 1-000 61 
| Mean of percentages 67 


It must be remembered that Johanessen’s experiments have been shown by 
Karl Pearson to be inconclusive with reference to the pure-line theory. The size 
of the different seeds produced by any given individual plant varies greatly, and there 
is no inherent reason to suppose that the offspring obtained from the small seeds 
and large seeds taken from the same plant would exhibit any difference in the 
size of their seeds. It might reasonably be supposed that a small seed and a large 
seed from the same plant would both inherit the average character of the parent 
irrespective of their actual size. One might as well expect that the plants obtained 
by budding from a small begonia leaf would have smaller leaves than those obtained 
from a larger leaf of the same plant. 


IV. Fraternal Correlation. 


The likeness of brothers to one another has been investigated, and the relation- 
ship was studied after the method adopted by Prof. Pearson. He takes each 
individual of a family and compares its deviation from the mean with that of every 
other member of the family. These combinations can be plotted out in a correla- 
tion table in a similar manner to the relationship of offspring and parent. The 
coefficient of correlation for all possible pairs of brethren will be a measure of their 
resemblance, just as the standard deviation is a measure of their dissimilarity. 


The fraternal correlation was calculated for (1) the intensity of the general 
colouration of the purple flowers, (2) the area of the spots, (3) the ratio of the 
width to the length of the corolla. 


For the intensity of the general colouration only offspring having a similar 
gametic constitution were taken. Heterozygous individuals, or those which are 
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supposed to produce two kinds of gametes, were used. With regard to the ratio 
and the area covered by the spots the nature of the individual was determined 
by taking the mean of the varying values obtained from six different flowers from 
the flowering axis. This procedure was unnecessary in the case of the colour, 
as the individual flowers on the same axis did not differ from one another per- 
ceptibly. 


As a record the fraternal correlation table for spotting is herewith given: 


| { 
Percentage | | | | 
spotting, 281 | 378 | 425 | 596 | 300 | 168 | 447 |. 21 16 | 4 Totals 
Ist brother 
| | 
0—3 64 98 70 | 47 2} | == = 281 
4—7 98 138 73 65 4 — — = — — | 378 

8—I11 70 Vi} 124 9] 34 20 8 1 4 — 425 
12—15 47 65 91 178 107 | 53 | 48 5 2 — 596 
16—19 2 4 34 107 64 49 30 8 My — 300 
20—23 = = 20 | 53 49 | 20 19 4 2 1 168 
24—27 — — 8 48 SO CLO o6r nee 4 1 | 147 
28—31 — = 1 5 ae eee On ee | 0 I yo} 6 oy 
32—35 — — 4 2 2, 2 4 1 0 1 16 
36—39 ; o— at ey ee ae 1 1] — 4 
on | : | | 

ee | | | | | 
aes oR 7 i | | EN | 23360 
Rs) 2 RS S > co | & So 
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The comparative results obtained for parental correlation and fraternal correla- 
tion are shown in the following table: 


| Parental | Fraternal 
Character | < i No. of | N iy = aa _ 
No. 0 No. 0 fee | o. of pairs (me 3 
| families | offspring Coes ei Mele of brothers | Correlation 
Intensity of purple colouration | 14 98 47 | 318 “52 
of corolla | 
Area of spots ... ee Seals lO 178 53 | 1168 | ‘61 
Revit WEEN aio a 19 170 oie) i) ai032 |) 228 
Length | 
Mean of correlations Ae -46 | — | 47 


The parental correlations given above differ somewhat from those given in 
an earlier part of the paper owing to the fact that the mean of the character, as 
it occurred in the six flowers, was taken instead of using the individual flowers 
of the plant. 
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The means of the coefficients of the three characters are -46 and -47 for 
the parental and fraternal correlation respectively. These results are not very 
divergent from those obtained by Prof. Pearson from very various sources; he 
gives the average parental correlation as about -48. 


V. Summary. 


The data at present available were derived from crossing in various ways 
10 cultivated foxglove plants with different characteristics, namely: crowned or 
peloric, not crowned, purple of different intensities, white, relatively wide and 
relatively narrow corolla, etc. There were 19 families; each family being raised 
from the seed of a single capsule. Altogether 188 offspring which flowered were 
obtained. } 


The nature of the inheritance of the following characters was studied: (1) 
peloric or crown character, (2) the colour of the spots, (3) the relative area covered 
by the spots, (4) the colour of the corolla and the degree of intensity of the purple 
colouration of the general surface, (5) the ratio of the width to the length of the 
corolla. 


The numerical results obtained are given below: 


Mendelian Inheritance Statistical Relationship 
(parent and offspring) (parent and offspring) 
| 
| a 
Observed a norouces Coefficient of Correlation 
xpectation 
Character = oe 
| | | Foxgloves Mean of 
Me| F co : coeffi- 
No. with | No. with | No. with | No. with |— eiente 
character | character | character | character Including Excluding poe 
present | absent present absent | seif-fertilized || selfetentilined canes 
families families sources 
= = _ — _ ~ || —_—— 
Crown ~ ave) Gassbl sce |e Soe meld 83 | 104 — 
Brown spots aes ata 23 69 23 69 = = 
Percentage area covered by = == poe aes | 43 32 
spots 
General purple colouration 41 | 139 49 131 “47 “41 
Ratio of breadth to length |  —- seen = — 34 30 
of corolla | | 
— —— ——— — — — s — —_ —— 
| | 41) ol ae od 
Totals <-, | S20) > <4.) GILT Osa 155 304 ! 
| 38 


(1) The peloric character may be regarded as arising by the suppression of 
certain internodes of the flower-axis and the concurrent fusion of the flower-buds 
resulting in a more or less regular cup or saucer-shaped flower at the top of the 
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axis, which matures before the ordinary zygomorphic flowers. This variety is 
known as monstrosa. 


When a peloric plant is crossed with a non-peloric plant the offspring are 
generally either completely peloric or completely non-peloric. There is no marked 
tendency for the offspring to be intermediate in character. Notwithstanding 
this fact the character is capable of dilution, since in the case of one family arising 
from such a cross nearly all of the offspring were more or less intermediate in nature. 
Certain of the internodes were much reduced, but not entirely suppressed, and the 
flowers involved arose in the form of petaloid blades which matured before the 
lower normal flowers, as in the case of a typical crowned plant. 


The gametic nature of the parents, as to whether they were pure or impure in 
the Mendelian sense with reference to the peloric character, could be judged either 
by raising families obtained by self-fertilization or by their action in the various 
crosses that were made. Families derived from self-fertilization from four parents 
were successfully raised, and in the case of the remaining parents the marked 
consistency in their behaviour in the different crosses indicated clearly as to whether 
they were homozygous or heterozygous. 


The non-peloric condition was dominant. 


On dividing the offspring into peloric plants (including those which showed 
any tendency towards a peloric condition) and non-peloric plants, the numbers 
were 83 peloric and 104 non-peloric. Calculating the Mendelian expectation 
for each family and adding the results the figures obtained were identically the 
same. 


(2) The results of reciprocal crosses appeared to be the same; that is the 
families obtained from 9 A x 3 B did not differ in any constant manner from those 
derived from 9 B x 3 A. 


(3) The results obtaimed by using pollen from the crown flower, the ordinary 
zygomorphic flower and of flowers with abnormally split open corolla did not appear 
to differ from one another in the least. 


(4) The colour of the seeds is very definitely related to the colour of the flowers 
of the seed-plant. Plants with white flowers had very pale buff-coloured seeds, 
those with hght purple corollas had light brown seeds, while those with dark purple 
corollas had very dark brown seeds. As might be expected, a plant with white 
flowers crossed with pollen from a plant with dark purple flowers produced pale 
coloured seeds, since the coloured testa of the seed is formed by the seed-plant and is 
not a result of fertilization. 


This observation emphasizes the fact that great caution is necessary in 
regarding characters as being independent of one another. It is often stated that 
an organism may be pure bred in one character and not in another; but probably 
this is true only in a very limited sense. De Vries holds that a change in any one 
character involves some change in the constitution of the organism as a whole. 
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(5) In plants in which the general internal and external surfaces of the corolla 
are white, the spots which occur may be purple or brown. Sometimes spots were 
nearly absent and the corolla appeared to be practically pure white, but when such 
was the case some sign of brown spots could be found on some few of the flowers. 
An obviously intermediate condition between purple spots and brown spots was 
rare, and judged by ordinary inspection the spots were either purple or brown, 
seldom brownish purple. The brown colouring matter may be regarded as a 
decomposition product of the purple anthocyanin. In plants in which the corolla 
was coloured purple over the general surface the spots were never entirely brown. 
Thus it would appear from the present observations that the causes which lead to 
the total decomposition of the anthocyanin in the purple spots of otherwise white 
flowers are never wholly effective in flowers with purple corolla. Accordingly 
although there is some independence between the two characters, the colouration 
of the spots and the colouration of the general surface of the corolla, yet never- 
theless there seems to be a certain relationship between the two, since the spots 
were never found to be completely brown unless the general surface of the corolla 
was white. A feature of unknown significance may be mentioned, the cells of 
the spots containing the brown colouring matter were found to be densely 
crowded with typical starch grains, while the cells of the purple spots containing 
the anthocyanin in solution in the cell-sap contained little or no starch. 


Purple spots were dominant and brown spots recessive. 


The observed results in the different families totalled 69 plants with purple 
spots and 23 with brown spots. The Mendelian expectation calculated for each 
family and added together yielded exactly the same figures. Since completely 
brown spots were never found in coloured flowers, only those families in which 
plants with white flowers occurred were included in this calculation. 


(6) With regard to the general colouration of the corolla it was found that 
some of the parents were heterozygous and others homozygous. The results of 
the crossings were distinctly Mendelian in character. Purple was dominant and 
white recessive. There were 41 white and 139 purple offspring; the theoretical 
expectation, calculated for each family and added together, was 49 white and 
131 purple. 


A point of considerable interest was noticed, namely that im dealing with 
parents and offspring of similar gametic constitution (i.e. heterozygous and 
homozygous dominant) the inheritance of the intensity of colouration followed 
the usual statistical laws. A light purple plant crossed with a dark purple plant 
might give some perfectly white offspring, but those offspring that were coloured, 
if of the same gametic character as the parent, tended to be intermediate in shade 
between the two parents. Thus the intensity of colouration is not a mere fluc- 
tuating uninheritable character, but its inheritance follows the usual laws found 
by statisticians to hold in the case of characters where a Mendelian relationship 
cannot be detected. The parental coefficient of correlation. between heterozygous 
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dominant parents and heterozygous dominant offspring including self-fertilized 
families was -47 for male parent and -41 for female parent. These figures are not 
far removed from Prof. Pearson’s mean coefficient -45. 


(7) The remaining two characters dealt with, viz. the percentage area covered 
by the spots and the relative width of the corolla, exhibited no discontinuity in their 
inheritance, and consequently a Mendelian relationship could not be detected. 
The parental coefficients of correlation for the spotting, including and excluding the 
self-fertilized families, were -43 and -32 respectively, and for the width °34 and -30. 
The mean of all four coefficients is -35 which has to be compared with Prof. Karl 
Pearson’s mean coefficient of -45 derived from various sources. Owing to the 
fewness in the number of parents the probable errors of the foxglove coefficients 
of correlation are large, but nevertheless there appears to be a distinct tendency 
for the parental correlation to be low. Similarly Prof. Karl Pearson found parental 
inheritance to be low in the Shirley Poppy: the mean value being -33. 


(8) The mean of the seven mid-parental correlations, calculated for the crown 
character, the spots, the general colouration and width of the corolla, is +57 and 
the mean of the six parental correlations is -38. 


(9) The experiment at the present stage cannot throw any direct light on the 
pure-line theory. The variability which occurred among cross-fertilized families 
amounted to 67 % of the variability of the race, and among several self-fertilized 
families it was 50%. With such a high variability it would appear to be unlikely 
that the families, raised by self-fertilizing the two brethren at the two ends of the 
range of variation of the families, would exhibit the same mean. 


(10) The resemblance of the individuals of a family to one another with 
reference to several characters was calculated, and the mean of these fraternal 
correlations was found to be -47. Prof. Karl Pearson gives a mean value of -5 
or a little more for the fraternal correlation of a series of diverse characters. 


VI. Theoretical Conclusions. 


We thus see that among the characters examined we find typical Mendelian 
and typical non-Mendelian results, and the relationship between the two types 
of inheritance is by no means clear. It has been said that Mendelian inheritance 
occurs in the case of characters which do not mix, but this view is not warranted 
by the present results. For example, the peloric character can be diluted by crossing 
with the ordinary type, but such dilution does not usually take place; also, the 
general purple colouration of the corolla is diluted when a dark parent is crossed 
with a pale parent; but, nevertheless, a very pale general colouration is exceedingly 
rare, and the flower is as a rule either distinctly purple (although perhaps rather 
pale) or white. Thus we have discontinuity even when characters are capable 
of dilution. 


From the experimental data which have been accumulated during the last 
decade there is no doubt that segregation of characters may occur in crossing 
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hybrids, and by defining the unit-character in a suitable manner the familiar 
ratios so well known to Mendelians do appear in the case of certain characters. 
The above expression “defining the unit-character in a suitable manner” is well 
illustrated in the case of the general colouration of the foxglove flowers. Provided 
we define a “purple flower” as meaning a flower in which the corolla is more or 
less uniformly tinted all over we obtain a simple Mendelian relationship; but 
many of the white flowers, on being closely scrutinized, were seen to possess an 
excessively faint tinge of purple over small isolated areas, the remainder of the 
corolla being dead white. If these plants had been included under the category 
of “purple flowers” no Mendelian relationship would be apparent. There is, 
however, a certain justification for excluding such from the category of “purple 
flowers,” since there is a distinct gap in the series between the flowers which are 
tinted faintly purple all over and the white flowers with an extremely slight 
blush of purple over small irregularly placed areas. 


A similar relation occurs in connection with the brown spots. Provided the 
spots are examined with the naked eye they are very generally either brown or 
purple; brownish purple spots were seldom observed, and a simple Mendelian 
relationship occurs, brown spots being recessive. If, however, the purple spots 
are examined under the microscope the distinction between the two kinds of spots 
is seen to be not so sharp, since a certain amount of decomposition product may 
be present, although not as a general rule in sufficient quantity to give a spot 
a brownish purple appearance to the naked eye. Here, again, we have a distinct 
gap in the series; it occurs between a small amount of decomposition and total 
decomposition. <A strict Mendelian might hold that in the first case the slight 
tinge of purple occurring in the white flowers was a character distinct from the 
purple colouration of the general surface of the corolla, and was controlled by a 
different determinant or group of determinants; but such a supposition could 
scarcely be applied to explain the occurrence of the gap between the small amount 
of decomposition and the total decomposition of the anthocyanin of the spots. 
By presupposing the existence of many factors controlling a character and the 
linkage of factors it may be possible to express in Mendelian terms almost any 
experimental result that may be obtained, but it must be confessed that there 
is an artificiality about the process which does not inspire confidence in the 
interpretation. 

With regard to the peloric character, in order to obtain a simple Mendelian 
relationship, it is necessary to include in the peloric category all plants which 
exhibited any distinct tendency towards the peloric condition. There is a gap in 
the series between flower-axes which exhibit some shortening of the internodes 
of the upper flowers and a marked shortening resulting in a distinct abnormality ; 
and it is this gap in the series which gives the clue as to where the line of division 
should be drawn between peloric plants and non-peloric¢ plants. 


The question arises as to how far a so-called unit-character is capable of change. 
In the course of evolution any given character must of necessity have arisen in some 
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way from some other character, and according to de Vries it arose by a sudden 
mutation. It is clear, however, that the somatic nature of an individual tends to 
reflect its gametic constitution, and obversely any change in the somatic nature 
of an individual tends to imply a change in the determinants of the germ-cells; 
and therefore the factor or determinant, say, of the general colouration of the 
corolla in a pale purple heterozygous dominant foxglove differs to some extent 
from that in a darker purple heterozygous plant: and the offspring of these two 
plants, if crossed with the same pollen, and bred under the same environment 
(in its widest sense) would not be the same. I have deduced some evidence in this 
investigation to show that the relationship of parents and offspring with respect 
to the fluctuations in the intensity of expression of a Mendelian character is similar 
to that of a non-Mendelian character provided the parents and offspring are of 
the same class, i.e. all heterozygous or all homozygous. It follows from this that 
a slight variation in the nature of the factor in the parent is accompanied by 
a slight somatic variation in the offspring and will be subjected to natural 
selection. 


It appears to be premature to attempt to visualize these complicated pheno- 
mena in the nuclear behaviour of the sexual cells, although Morgan and others* 
have made ingenious attempts at so doing. In any such attempt it seems necessary 
to assume the existence of factors, determinants, plasms or something, no matter 
what it is called, which bears the heredity influence. Speaking in these mechanical 
terms it may be suggested that characters which exhibit no discontinuity in their 
inheritance, i.e. show no Mendelian relationships, are transmitted by a determinant 
or a system of determinants uniformly distributed through the chromosomes, 
and perhaps also through the cytoplasm of the germinal cells; while those 
characters which do exhibit Mendelian relationships depend on determinants 
which are localized in the chromosomes only, and possibly in definite places in 
the chromosomes. Morgan has recently referred to this last conception as the 
linear arrangement of factors in the chromosome. 


In both cases, whether localized or not, the determinants controlling a particular 
character are variable in nature and are therefore subject to gradual modification 
by the natural selection of the individuals which bear the genital cells. 


Mendelian inheritance in its original simple form, as it sometimes appears in 
the crossing of varieties, would seem to be, so to speak, an incidental phenomenon 
arising through the nuclear changes and the localization of the determinants, and 
it is open to question whether it has had any marked influence in the evolution 
of species. It is conceivable that evolution has taken place in spite of, rather 
than by the aid of, such Mendelian inheritance. The hereditary influence of the 
parent varies slightly through changes from unknown causes in the nature of the 
determinants, and the small resulting somatic variations in the offspring would 
appear to constitute the chief means by which evolution may slowly proceed 
and adaptation to the environment can be effected. 


* Morgan, ‘I’, H. and others, The Mechanism of Mendelian Heredity, New York, 1915. 
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(1) Introductory. In a paper of 1908* “Student” dealt experimentally with 
the distribution of the correlation coefficient of small samples, and gave empirical 
curves—in particular for the case of zero correlation in the sampled population 
—which have proved remarkably exact. The problem was next considered in 
1913 by H. E. Sopery who obtained the mean correlation and the standard 
deviation of the distribution of correlations to second approximations. Of the 

* Biometrika, Vol. vi. p. 302 et seq. 
+ Biometrika, Vol. 1x. p. 91 et seq. 
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formulae he gives for 7 and o, of the distribution of the correlation 7 in samples 
of n from a population of correlation p, we have found in practice the most exact 


are* 
pee meee isiyn2) = (4. = 42) (1 4 3p%) (i) 
r Pp) on p Sn? p a New istceion sates renee’ 
= BAP Ran 2 4 
and Gas ati) (1 Dat P| RE MMCLiS NR mc nr aes Mea rain aenmeeee (ii). 
nN nN 


Soper also by assuming a Pearson curve of limited range + 1 to — 1 of type 


L m 1 L m 2 
rs yo(1 =F) @ oe) 


deduces the modal value * of 7 as approximately 
_* (62+ FP) . 
YQ X u z eee eer ens 
Saar eae agit ray (111), 
so that * would be determined from a knowledge of 7 and o,? + T°. 


The next step was taken by R. A. Fisher who gave in 19157 the actual frequency 
distribution of 7, namely the curve 
n—-1 
== n—4 
(T= p2)i 2 Ree dt GOs" (— pr)\ 
pai ar) 2 | 
a(n — 8)! d (rp) V1 — p*r? 
Except for very low values of n this expression for y,, does not provide a formula 
from which the ordinates of the frequency curve for 7 can be readily determined, 
and as the problem was left by Fisher there were no rapid means of numerically 
determining either 7 or * or again o,”. 


Clearly in order to determine the approach to Soper’s approximations, and 
ultimately to the normal curve as ” increases we require expressions for the moment 
coefficients of (iv), and further for practical purposes we require to table the 
ordinates of (iv) in the region for which n is too small for Soper’s formulae to 
provide adequate approximations. These are the aims of the present paper. It 
is only fair to state that’ the arithmetic involved has been of the most strenuous 
kind and has needed months of hard work on the part of the computers engaged f. 
On the other hand the algebra has often been of a most interesting and suggestive 
character. 


(2) On Properties of the Function U = cos-! (— «)/V/1 — 2°. 
dU i, © cos} (— 2) 


We have oe a SS z 
dx: 1— 2# (1 ey x2)? 
dU 
or (al A 1 SE AO 
* See loc. cit. pp. 105 and 107. + Biometrika, Vol. x. p. 507 et seq. 


{ Besides those whose names are given under the title, we have to thank I. Horwitz for some 
calculating aid, Ethel M. Elderton and D. Heron for occasional assistance, especially in the experimental 
part of the work, and lastiy but very far from least we have to acknowledge the untiring work of 
H. Gertrude Jones and Adelaide G. Davin in the construction of the models the beauty and accuracy 
of which are not more than suggested in the plates. 
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Apply Leibnitz’s Theorem and we have 


anu Cheer Of (n — 1) (n — 2)d"-2U ies 1U Gey) 
Sep Jo) eee On Ni Saemee) a Z -a)\ ae 
a a A A OO 
Be OE on Bear dU Pee, 
or, (1 — x?) Fae a! (2n Fe n AR Oe (v) 


Put x = 0 and we have 


EON, nee es 
(a) (n = 1) Gaal , 
but clearly Uy = $m and (dU/dz), = 


Hence by Maclaurin’s Theorem 


cos*(— a) a 12 Bae (2s — 1)? (2s — 3)? 2. 1? 
VT? Al ray hy Saas (2s)! a: ) 
2? 42 92 (2s)? (2s — 2)? ... 22 ) 
7) | n/m me. p2stl1 
+(2+ 5,0 fe ae (2s +1)! - 3 “s(W) 


We are now in a position to give the successive differentials of U which may be 
either even or odd. We have for the two cases 


ds (cos—t (— 2 


dx® | /1 — a 
2 2 
Dios 18 (2s — 3°... 18 fr 4 I ul py ee ae 4 


+ (28)? (2s — 2)2... 22 {e gp AS aie) es eSaiae ee at oof li), 


3! 5 


‘ 2 ‘ 2 2 2 
= = (2s — 1)? (2s — 3)?... 1? jor AF z peas ae Ssisd) ee + 
2 Qe)? (Js + 9)2 Nt 
4 (Qs — 2)? (2s — 4)? ... 2 145) ne) or xt 4 of nV) 


the development of the several series being clear. 


For calculation of y2, Or Ys, the above series are idle, just as they are when 
substituted in the equation for dy,/dr = 0 which gives * They converge far too 
slowly to be of use for numerical evaluations. But as we shortly shall show, they 
are, after certain transformations, most valuable in determining the moment 
coefficients. 


[pene 2U 
Now Yn = fc oF qd ah 22 Ta n—2 ) 


y 


n+1 n- 2 
multiply (v) by (1 — p*) 2 (1 — 72) 2 ie (n — 1)!), 
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and we have 


‘(Qn —1)V1—p?V/1 — 7? (1—p2)(1 — 7?) 
5272 pr ( ia p p oye 
( p r )Yn+2 n—1 n+1 (n fer 1) (n —2) (n 1) Yn 0, 
an — I n—1 ee 
thus Ynsa= 7 KY net + 1 9 an tet t tte (vill). 
prV1—pPvV1—r? alee a) 
Here (ey == l Ss per r) LS) Sa 1 = per 


are constant for p and r given and thus (vill) enables us to deduce y,,,9 from Yy44 
and y, fora givenpandr. But by simple differentiation 


ne i oa da ( 1 : pr cos? {= pr) \ 
ee of ee Memos 1) Deane nur (ix). 
Leon aa 


Hence if y; and y, be calculated for a series of values of 7 and p all higher values 
may be reached by a repeated use of (viii). The values chosen were: p proceeding 
by -1 from 0 to 1 and r proceeding by -05 from — 1 to + 1. 


The disadvantage of this method of calculating y, is that, except by inde- 
pendent computing, there is no means of checking accuracy until all the ordinates 
have been deduced, and any mistake in y, for a low value of n is perpetuated 
throughout the series. When all the ordinates have been found, say for n = 25, 
then the smoothness of these ordinates and the fact that they give the correct 
total area with a suitable graduation-formula will be checks on the accuracy of 
the whole system of ordinates. In this manner Table A, p. 379, was calculated. 


Another method of approaching the value of y, is of some advantage. We 
may take 
n-1 P 
es) z 2 > OF Uy, cos! (— pr) 
In (hw — 3)! ss a) ( Se ee sur ace (x), 


(1 be p2r2)r— on =o) 


(ie ore)? 
where v, and u, are functions of pr in integer positive powers, and if we substitute 
in (viii) we obtain 
Unt2 = (2n im, 1) prUnty aly (n aa, Le (1 ay p>1?) Un) (xi) 
Le pee ne 13(l tra eee 


We may write y, in the form 


C= OF apr ANT pir)” 


V1 — p? 0 _ cos™! (— pr) 
es te= 02rd — 9) (Ta paae ( nah) 


di cosa (= aa 
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Hence v, = 0 and uy = 1, while y, will vanish for all values of r, except r= + 1 
owing to the factor (n — 2),-2. Thus (ix) gives us 


0,= 1, Ug = pr, 


whence by (xi) 


VU, = 3pr, U, = 1 +. 2p? 7?, 
V5 = 4+ I1p?r?, Us = pr (9 + 6p?r?), 
V_ = pr (55 + 50p?r?), Ug = 9+ T2p? 7? + 240474, 


and the successive values can be rapidly calculated, much faster than by actually 
differentiating out (iv). It is, however, shortest to insert the numerical values 
of p and r in (xi) and deduce the v,’s and w,’s numerically in succession. (Table A 
was, however, in the present case deduced from (viii). We did this by direct 
calculation of the values of 


Yn V/1 — p? cos} (— pr) 


Goole a1) a — parad’ 


and ys in equation (ix). Equation (viii) then gave us the numerical values of 
Ya» Ys, etc. in succession.) 


We may write (x) in the form 


jam PE 1, FO) (xii), 
Bi 
: OR eke ep cooad at 
where Vn = (n = 3)! (1 = p22" and aaa Vl pit 


Here V,,, Y,, U, and U are symmetrical in p and r and accordingly p and r can be 
interchanged. The problem approached this way involves: 


(a) calculating (1 — p?) 2/ar for various values of p; 
(b) JU for various values of pr; 
(c) V,, for various values of p, r and n; 


(d) determining wu, and v, in succession from (xi) for various values of pr 
and n. 


Lastly we may use the series for y, to be given later (see Eqn. (xli1)) which 
develops yn In inverse powers of (n—1). Actually we have adopted (vii) for 
tabling the ordinates of the first 25 curve-series, and the last expansion for 
verification and higher cases. 


(3) On the Determination of the Moment Coefficients. We shall next determine 
the value of the moment coefficients about 7 = 0, as origin, and shall deal with 
the even and odd coefficients independently. Let them be p’,, and p'sy44- 
Clearly, the total area having been taken as unity: 
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2? — 72? — (72 — 1)? + (— 1)? (1 — 9°)? 


= prep-2 a Pp ie 1) y2p—4 1 p ( y ag “te ax 2) y2p—-6 as {= 1) (1 jens 72)P, 
Hence 
, , —1) , —] —2), bie 
MK ap = PH 29-2 — ale Le 2p—4 Pe Ma page ent (= LP Xo, (XIN), 
n—l 
roe n—-4+4+2p 
je (ES Pl. ei ear wes 
where Nari = Geant (1 — r?) d(pry™= dr. 


Thus using (vii)is on the assumption that n is odd and remembering that odd 
powers of r will now disappear we reach: 


n—-1 
(1 — p?) 2 ee te ete te a. 
Me = ag ayy (8) (n—8)t...28 (ip + MS ptt tit...) 
“44 n—-4+2p 
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Now we may write r = cos d, so that 


rT 


2 
dom = 2 iF sin" +2?-3 4 cos?” d dd, 


: 2m —1 ; : 
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issip?) 2 2 (. 2 2; Gy ee 
Xa0 = oy at (% — BFW 5) 1a. 28H (1+ A eee 
CO DM Ge) Sere oe ; 
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where F as usual denotes the hypergeometrical series. But by a well-known 
transformation due to Euler 


See Ha oneeoey ae) (lec) mes (rary — 1B, 474.2) ine esas. (xv), 
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and accordingly 


n-1l1 n—-1 n—I1 p 
FAS. tS pote w)=a-09 


ped 


Tye n—1 
= F(p, p, "S—+p, p*) 


4 “a 


> ilies oaeee te i n—1 
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Now ie fr sin" t2?=3 hdd — 29), .en-9) ‘SAY, 
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/ 0 
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if n be odd as supposed above. Thus finally we have 
briny if n— | as 
Xop = (1 — p?)? 1 ae °F [Ps Ps —y— + P, p°) eee (xvii). 


T 


A Table of g, = iF sin’! ddd from n = 1 to n = 105 is given on p. 377 below. 
/ 0 


Now (xvi) has only been proved for n odd. If n be even we must take the 
first series of (vii) and this gives 


n-1 


2) 2 CL ie. oe Du fane 2 yl = Dee i 
Xo = 7 (n— 3)! 2 (n 3) (n 5) Sonidud 1 lo Il 1 21 Pp wie 2p — 1 
CEN Ora A 1 = oan \ 
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since nis even. Thus 
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or (xvii) holds whether » be even or odd. 
As particular cases we have for p = 1 
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The series in (xx) and (xxi) for n = 25 and upwards converge with sufficient 
rapidity to determine jp.’ and yp,’ rapidly and therefore with accurate values for 
py’ and p,’ will give p or o,” and p,, and thus provide the determination of B,. 


We will now determine p’,,;; in like manner. We have 


but 
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_ i : d?-1U 
Assuming » odd we must keep the first series in the value of ,— 


d (fo) we 
reach 
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if we use Kuler’s reduction formula, and note that for n odd, or n — 1 even 


3 (n — 3) (nm —5)...2 
em ea Sel 


If we start with even we reach an absolutely identical formula by a different 
route. Thus we have 


, ie 
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Taking p in succession equal to zero and to unity we find 
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Equations (xxv) and (xxvi) provide the values of the odd moment coefficients 
about zero and this in fairly rapidly converging series. From them we can deduce 
the value about the mean ps; and thus find the fundamental f,. Table X, p. 377, 
again gives the requisite values of q, for the range n= 1 to 105. 
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Illustration. Samples of 25 are taken out of a population in which two variates 
have the correlation p = -6. Determination of the nature of the distribution of r 
in these samples. 

Here n = 25, and with p =-6 we find from (xx), (xxi), (xxv) and (xxvi) the 


values * 


wy! = F = 591,825, ig’ = °368,739, 

jig = -238,298, tug’ = *158,510. 
Further lta = 018,482, o, = 135,950, 

us = — -001,812,380, ju, = 001,279,141, 
giving B, = -520,265, By = 3°744,573. 


The distribution is thus very far from normal. 


Hence by the formula t : 

ae VB (Bs ak 3) eee eer eee 
2 (5B — 6B, — 9) 

we find * — 7 = -050,094, 


Distance from mean to mode = 


We shall see later that the actual value is 
* = -64194, 
or the approximation is very close. 
The skewness is given by 
Sk. = (# — P)/o, = -36847, 
thus indicating that there is but little approach to normality. 


Fig. 1, p. 338, shows the excellent fit of a Pearson curve of Type II to the dis- 
tribution. The equation is 
x 5+ 7536 v 178-5135 
g= 3100 (1— sro) (1 +ggam) 
We see that when n = 25, Pearson’s curves—fitted by moments not by range— 
adequately describe the frequencies, but there is still no real approach to a 
Gaussian distribution. 


The series-expansions which have been given for the determination of the 
moments are of very little service when v is less than 25. We have therefore to 
consider formulae for deducing in succession the moments about r = 0 for n= 5 
to n = 25. 


* The values were in every case worked out to nine places of decimals. 
y Pearson: Mathematical Contributions to the Theory of Evolution, xu, p. 7. Drapers’ Company 
Research Memoirs, Biometric Series, Cambridge University Press. 
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By (xxv) 
z des | pase 0 a3 32 (2s — LR ps rh \, 
iceeat ree m+32 °° ° sg! (n+ 3) (n+ 5) — (n+ 28 + 1) 28 
i In 12 p? 12.32... (28 — 1)? p> ) 
= 1 12 pa Se 
es oral 2 sl(w+1)(v+3)..(v+2s—1l a ")” 
a In+2 n i oe 
ee hoes In-1 fe 1 (eae: 2 $s 
ye ae Cee a 
s!(n+3)(n+5)...(m+28+1) 28 ° =) 
net ie 2 ps eee eee LE ioe 
n n+12 s!(n+1) (n+ 3)... (w+ 2s —1) 28° JS? 
since Grete Giles 
Yn+1 ii) ee Mk 
The general term is therefore 
tl dna — 2 (wd (2 1)) 
(n—1)(n—2) Pq. mn \n+2s+1 


We 3? (2s — I)? je 
s!(n+1)(n+4+ 8)... (m+ 2s — 1) 28 


Now in (xxvi) we have seen that 


“ Ne Inia W — 2 p 
Tn — Pan p(l p”) nce {1 a iaseo tee 


oy kate 3 .. (28 + 1)? ee | ! 
sl (m+ 8) (n+ 5)... (m+ 2841) 22° °°}? 


and the general term is 


dasa = 2 ( 88.5... Qs 
Gna” \8!(m+3)(n+5).. 


--+ 
=e wt 3. 52 . (28 = ie oe po 
(s— 1)! (n+ 3)(n ane (n+ 2s — 1) 28 
eon De see Sl) p? 
aF Qs 


= ae 2 (2s + 1)? 28) 
ees nm \n +241 — J st(n+3)(n +5)... (n+ 2s—1) 
eee 2 tos @ ot) (18.8858... Os — 1)? p® 
pe 0s: n nA 934 1 Greil) (3) (Eb Os = 1) 2 
WOH 
Ynse  — 2 (= sia) (n? — 1)) 
n-1 1 n+2s+] 
12.32.52... (2s — 1)? pes 


ICE CBE CE ee Ce ESS 
= (n — 1) (n — 2) (Tne — Tn). 


Z - T,— ae 
Hence (ae bs i a DG ens a ee (xxvill). 
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This result expresses the mean for samples of n + 2 in terms of the mean for 
samples of n and the third moment of samples of n. 


Next let 
n-1 
noon +1 n-4+ 2p n—2 
tenn = TP amy 2 OO, 
TCs ee d (pr)"-? 
n—l 
ghee 
— a (n— 8)! Fan 
Now integrate f,, , by parts twice : 
n—-6+ 2p 
Bee a) Pee a ore, odes 
Ine =p, = aes | d (1 ae ) d (pr)”- yrs OF 
i re ne —-8+ 2p 
=" ne ae {(n—6+2p)(1—1%) 2 
n-6+ 2p 
ean ae San, Fe Get OF : 
(W160 4 Ape Le n2)} * T(prye 
ae) 
= ee HOD) fsa da 100 6 + pees 


Or returning to the xoy,, notation 


1 a= op 4 tee 4) S 
; : aa 3) (n — {(m — 6 + 29) Xep-2,n-2 — ( — 6 + 2p + 1) Hop.) 


X20,n = 


As special cases put p = 1 and 2, and change n ton + 2. We have* 


l—p n n—1 F 
X2,n+2 = p2 Ses {Kon = eo) Xan} elofarsteredurercveletetetel opt tetarets (Geb), 
1—p? n+2 n n+ 1 ) 
Xa, 142 = — p == n = il . | By n BY Xa mi eee eect cece cee (xxx) 
But Yon Sand ayo, —= ea, 
Xan = ie 21's, n ata [aon 
Accordingly 
J 1 ed : ) (XxXx1) 
- a,n+2 = = pp ne) Ham — 5, dyevatetaveretererstlenslereberet 5 


which can be verified directly from (xx) or (xx)bis. Again instead of working 


with the series for y,,,,;. above (xxx), we can replace it by one involving the 
moments about 7 = 0, directly: 


: 1 eye eee CS) ee ee es --4 
eae csr C7) al 


* The process of integrating by parts shows that we must have n> 2. 
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The diagram referred to on p. 341 will appear with Part II 
of the paper. 
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It remains to determine the formula for p’s,,,2. We have 
n—1 


mon Sf 2p 
1 yee +1 pees fr 
X20+1,n = ( a) p’) | (1 — 7?) a r oe dr. 
aw(n— 3)! J 4 d (rp)"—2 


whence by double integration by parts to reduce the U differential coefficient we 
obtain 


Sip fp, ‘ n—4+2p+1 
X20+1,n = 2 (n — 3) (n— 4) i = 0:20) Noy -aee ray Xep+1,n—-2/ : 


Putting p = 1 and changing n to n + 2 we have 


I—p*? n a+1 ee 
V3 nie = ee ead Xtin — ) 9 Xam (verses teens Coxe 


This may again be read as a formula for p's ,42: 


aw Ls p> Tn — L’s.n Sh ee , 
lee “i — Dim —2) (1+ m(m+ Dae) ee) 


L'3,n+2 = Ty (1 ss 


Starting with the values of the p’s for n= 3, 4, 25 and 26, the moment co- 
efficients about r= 0 have been determined for n = 5 to 25 in succession. As 
- controls the values for n = 20 had already been determined and those for n = 10 
were also obtained at a very considerable expenditure of labour from the very 
slowly converging series of Formulae (xx), (xxi), (xxv) and (xxvi). The initial 
values of the moment coefficients (i.e. those for n = 3, 4, 25 and 26) had to be 
calculated generally to 15 and sometimes to 20 significant figures, owing to the 
numerical factors in (xxviii), (xxxi), (xxxii) and (xxxiv) being frequently greater 
than unity, and thus errors in the last figure being repeatedly multiplied. According 
to the special value of p, it was found best sometimes to deduce moment co- 
efficients of n + 2 from those for n, and sometimes those of n from those for n + 2, 
i.e. to work up from 3 and 4, or down from 25 and 26. It seems unnecessary to 
enter at length here into the many difficulties that arose in the course of these 
calculations. We think they have all been successfully surmounted and that our 
final values may be trusted to the figures actually recorded in the tables. We thus 
found the moment coefficients and from them the values of 6, and f, for the ten 
values of p from 0 to -9, and for the values of n, 2 to 25, 50, 100 and 400. 
Diagram I shows that our 270 frequency curves are fairly well distributed over the 
most frequently occurring portion of the B,, 8, plane. Now our view is that the 
constants 8,, B, describe adequately for statistical purposes the bulk of the usual 
frequencies distributions. But we have provided tables of the values of the 
ordinates for the above 270 curves. Hence by interpolation it will now be possible 
to determine rapidly ordinates which will graduate with reasonable accuracy any 
frequency distribution whatever quite apart from the idea of sampling normally 
correlated variates*. 


* Francis Galton frequently insisted on the importance of forming Tables of frequency ordinates, 
which would graduate any frequency distribution in the ,, B2 plane. A scheme for covering this plane 
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In order to make use of our ordinates for graduating frequency curves we must 
express the distance from our origin to our mean (i.e. from r= 0 to r=7,) In 
terms of the standard deviation, and further the unit of argument of the abscissae, 
i.e. 05 in 7, also in terms of the standard deviation. Our interpolated frequency 
ordinates (reduced of course, to the size of the actual population) will then have 
to be plotted to intervals of -050,/c,, the origin being 7,,0,/o, from the mean of 
the graduated data, where o, is the standard deviation of the graduated data. 
Care must be taken to so choose the axis of abscissae of the graduated data that 
the sign of 4, is the same in the graduated material and the graduating frequencies. 
Table C gives the distance from the mean to the origin of coordinates in each 
case and also the abscissal unit for plotting both in terms of the standard deviation. 


(4) On the Determination of the Mode. Differentiating (iv) we have 


n-1 
n—-4 n 


dy, (lL — yee | Lee gs A ey ae =g7 ew 

PIC are tara) aay 

Hence the mode * is given by 

v1 d.-2U7 
i, = Wy iy oe 
. Oe We eye (3 ore!) Ars (rp)r— (n — 4), 
or writing p? == 7p, we have ; : 
2 54) OES (n — 4) p2 dn U 
(p p d (p2)"-2 = P d (p2)\r-2 eiaje'ase\e leloleieterehetets (0:0:0:0'0) 


where U is U with p put for 7p. 

Now (xxxv) is by no means easy to solve adequately, for if we solve it by 

approximation, 7 =p and #=p?* is not sufficiently close for an effective first 

approximation, especially when p differs considerably from zero. We have indeed 
from (v) the relation 

n-1]) PU n—3 [ 

(1 — p*) a p® (2n — 3) d (p?)r-? (n — 2)? ee 


= 0 (xxvii 


and this might be combined with (xxxv) to deduce in succession relations between 

lower pairs of differential coefficients, till we ultimately reach a relation between 

dU/dp? and U, but the process is too laborious except for very low values of n. 
Fisher has outlined another method of approaching the mode*. It is easy to 


see that 


cos—! (— 2) 2 ( ., 1-2 oar 
= ———a SS ———— t 1. i _ fF Ye 
Vi-2@ V1—# Fe A/tu ee ee as) 
2 £—@5\ 2 
ie {==} 
val SS ; 
<5 es oie 
—  So(E—a)?+1—a% ~ Jo €*— 2€ +1’ 


with a series of Pearson-curves has been long under consideration, but the immense labour of calculating 
the ordinates of 400 to 500 curves has so far prevented the actualisation of this idea. The present 
ordinate-tables go some way to supply the need Galton pointed out. 

* Biometrika, Vol. x. p. 520. 
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2 dz = dz : 
1 = e7* — — = = f = pr. 
Diy SS i. cosh z — i cosh z — pr’ gee 
n-1 
om -4 
(1 — p?) 2 , a 7 a (cae Ea) 
ee ine 8 ary tap 
n= ' 
_( ahs) 2 (1 ye Cae de — 
a(n — 3)! d (pr)”~? !9 cosh z — pr 
n—-1 
ou v—4 
(1—p?) ? a is dz fe 
= 2 1—y7?) 2 — pa (XX), 
(n — 2) 3 (i= 78) lo GH ae (xxxvil) 
n—-I 
mar _ ay 2 n—4 
pace 2) = p*) (lpr OL a Saye netaeren<e (xxxvii)Pis, 
Substituting in Hqn. (viii) we find 
m (1 — p?r?) Ina = (2n — 1) prl, + (w— 1M) Iyy «..-. (xxxvill), 
as the reduction formula for the J,,’s. 
faite : ; thas dz 
Sunilarly, if I = — ih (cosh 2 — py2)"72 A 
then — Mi(li=o5-) La — (20 — lips lh, + — 1) Loy ..-(xxxvin) os, 
Now using value (xxxvii)Pis for y,, the equation for the mode is 
(n—4)72,,=p(1—*)(n-I) ft, 
or, if as before, 62 = pv, we have: 
(= ANip2 I =p? = o8) = DL) Tgtaedmnks. (xxxix). 


This combined with 


ees el On 1) p2l, 4 — Ld oe Se es senaks (xl), 
should determine the mode. 


Now assume p? = py? + €, where py? is some first approximation to p?, then we 


find 
(1 — 4) po?E'n-1 — ( — 1) (p? — pot) In’ 


Cha 7 G 7 9 a 
(n — 4), + (%— 1) (n — 2) poly’ — (W— 1) (p? — po!) ns 
Sagan (xh). 
If we had obtained an approximation p,? to 6”, we could start with 
at fe 2 2 —l (—_ 2 
gas EE i a 
LE 1— po (1 — po’)? 


and by aid of (xxxviii)bis determine the /’’s in succession. If #, =TJ,,/I,_, we can 
put our results in the forms (xlii1) and (xliv) below, and calculate successive E's: 
(n — 4) po? 
[= Bee! _ (nm — 1) (p* — pat) 
LSS SS eee ( X11); 
E, + (n — 1) (n — 2) po? — 2 (nm — 1) (p? — po’) Ents 


344 Distribution of Correlation Coefficient in Small Samples 
n—2 
De re) 


But even this would be laborious had we to find successive values of H,, from 
(xliv). Actually, if » be moderately large, H, and H,_, tend to equality fairly 
rapidly. For example the following are the values of EZ, for py = -6: 


I, = 2:078,4173, I, = 1-873,8688 and therefore #, = -901,5845. 


and (m — 1) (1 — pot) £, = (2m — 3) po? + 


E, -901,5845 | Ey 1-470,8511 Hive 1-511,0031 | Eos 1-527,2571 
E, 1:257,5588 | Ey, 1-475,5703 E,, 1-514,1874 | Ey, | 1-528,7778 
H, | 1-183,5106 Ey, 1-486,5966 Ey, 1-516,9985 Hy, | 1-530,1728 
E; 1-451,8703 Ey 1-492,1848 Ex 1-519,5018 Eng 1-531,4570 
EL, 1-377,5430 | Hy, 1-498,5199 HHIBy 1-521,7436 EK’ 1-529,7263 _ 
E, | 1-453,2879 Ey, 1-503, 1022 E,, 1-523,7636 EY” —_1-531,0459 
E, | 1-445,7342 Es 1:507,3770 | Eos 1-525,5928 | jv” 1-530,1488 
| | 


Clearly £, and E#,_, approach equality. Now put H,;= #,, for n= 25 in 
(xliv) and we have for py = °6 
20-8896H”? — 16-92#’ — 23 = 0, 
which gives for the root required 
E’ = 1-529,7263. 
But we might also have made Hy, = Hy, and so reached 
21-7600H'? — 17-64H” — 24=0, 
which gives kh” = 1-531,0459. 
It is better therefore in finding #,, to equate H,, and #,,_, than £, and EL,,,. 
A stili closer approximation may be found by noting that 
E, — EH’ =¢« = E,4, — E”, nearly, 
where ¢ is very small. Hence since 
n (1 — po*) Enis tin — (2n — 1) po*#, — (n — 1) = 0, 


we have 
(n — 1) + (2n — 1) pp? B’ — n (1 — pot) BE” 
eS 7 7 eye ne. 
n (l= pol) E+ B”) — Qn 1) py! 
i aa oad 4 '2 
or, Bi? = E’ by (n 1) me (1 Po ) E (xlv) 


Awe p+ BOS One 


For the case of py = -6 and n = 25 we find 
E'” = 1-530,1488, 
and #,, — H’”’ = -000,0240, a close agreement. As a matter of fact as we only use 
£, in a small term the approximation H’ is generally quite sufficient. 


In the above method all turns on finding a good value of p,?, i.e. a first 
approximation to the value of the product of p and *. This may be obtained in 
either of the following ways: 
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First, choose the values of p,? and E’ to satisfy the simultaneous equations 


n — 4) po? 
i Pot _ (ns — 1) (0? — put) = 0, 
a S) 
and (n — 1) (1 — po!) B’ = (Qn — 3) ph + "=", 


Or, we have for py the equation 
— pm eee 4 2 a 2, pe eee 4 
: ree 1) = Ps TOL Ei ce ‘Doe au 
which writing p9* = z gives us 
(n — 4)8 (1 — 2) 2 = (2n — 3) (n — 4) 2 (p22) + (n— 2) (n — 1) (p? — 22, 
or 62? — z {(n — 4)? + p? (Sn — 8)} + (n — 2) (n — 1) p*=0....... (xlvi). 
As illustration if n = 25 and p = -6 
622 — 483-122 + 71-5392 = 0, 


giving z= -148,351, 
or, po? = tp = -385,164, 
and * = -64194, 


a value* in excellent agreement with the results on p. 337, and needing no further 
approximation. 
Again suppose n = 5, and p = 6, we have 
622 — 7-12z + 1-5552 = 0. 
Hence z= -288,6295 and p,? = °537,2425 leading to * = -895,404 as our 


approximation. We shall now use this value of py? to determine the true system 
of H’s corresponding to this value. 


Wehave (1 — po!) By = (20 att p, 

1 
while i (xlvil 

7 ,oc0s a) Saas 
= 1-091,8073. 
Substituting in 
Si 
nm X °T11,3705E, 1 = (2m — 1) x +587,2425 + a ; 


we obtain the series 
E,, = 1-091,8073, E, = 1-776,5988, 
E, = 1-786,2042, #, = 1-911,8858, E, = 1-947,6088. 
The values show us that #, = #, was naturally much rougher in this case 


than that of n= 25. However we find e=+-001,1177, p,.? = -538,3602 and 
* = -897,267, as our next approximation, involving no very great change. 


* Repeated use of Eqn, (xli) only modified this result to F = “641,939. 


Biometrika x1 23 
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To confirm this value of * we take as the first approximation to * the value 
given in the method of the following section, ie. * = 91344 giving p,? = 548,064 
and 


m x -699,6259E,,41 = (2n — 1) x -548,0640 + _ 
Using (xlvu1) we find #, = 1-103,9149 and hence ‘ 
Ey = 1-822,4446, E, = 1-828,4768, 
E, = 1-957,1738, E, = 1-994,3056, 
leading to e = — :008,8172, 
and Po? = °539,2468, 
or 7 = 89875. 


It will be seen that our two methods of approaching the true value of * still 
differ to some extent, although probably serviceable enough for practical purposes. 
Accordingly we will now make a further approximation starting from 7 = -8980 
Or po? = °5388, and we have 

E, = 1-093,5399, EF, = 1-783,0638, Ei, = 1-792,1629, 
Hi; = 1-918,2742, E, = 1-954,1949. 


These give « = — :000,4924, 
and consequently Po” = °938,3036, = 
with T= Ooi Ini, 


a value no doubt correct to four figures. 


It is clear that the process of finding the mode for » small is much more laborious 
than for n = 25 or over, because #,, is not nearly #,,,. Actually the value given 
for E’ by the simultaneous equation process from which we started is 

EF = 1-88 808i; 


which is only a rough approximation to the value #; = 1-911,8858. That method 
must therefore be followed by further approximations when » is much smaller 
than 25. 


(5) Determination of Ordinates and Mode by Expansions. 


Approximate Expression for the Ordinates. We may proceed to expand the 
Eqn. (xxxvii) in powers of 1/n or 1/(n—1). This will involve a knowledge of the 
expansion of 


ni dz 
= E : ; eke 
tn I, (cosh z— po?)"’ where p57 —ipr, 


and can be achieved by a process to which Pearson drew attention in 1902*. 


Let 1 1 @ 7 (422 + ag 24 tag 26 +a,’ +...) 


cosh z pe: i eye Nemesia (xvii). 


* Biometrika, Vol. 1. p. 393. 
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Then, if 
v = log (cosh 2 — po”) = log (1 — po”) + a'2? + ay’ 24+ a, 2° + ..., 


it follows that 
pe nla »_ 1 (dv\ 
Renton i), : ee Tl Gee 


or (cosh z — po?) - == SIME. 


and so on. 


Now 


Apply Leibnitz’s Theorem, differentiating (2s — 1) times, and we have 


: du. cy (2s — (2s 2) 2 diy 
sinh 27 + (2s — 1) cosh z 75 - == oe sinh z dato 


ds 
+ (cosh z — po?) aga cosh 2. 


Hence when z = 0 
d@v (2s — 1) (2s — 2) (2s — 3) /dtv' ‘ (Ge 7 
(2s — 1) fe ale Sica at al +o. + (1 — po’) ie) ! 


Now put s in succession 1, 2, 3, etc. and there results 


d*v dy ea GnON 
(1 — po?) (ia),— Us 3 Gal (er) ae Ug 
dv dv », (dv 
5 Gale 10 a (1 Po ) (Se), = 1 
d*v ~ (dv Be Kee : a 7 
7 (az), 3° (aa), + 21 (aaa), + 1 — 70 (es), 
dy d4y dsy ay dy 
ie et Fe 9 4 2 2 =) 
9 (is), + 8 (Gx), + 126 (es), + 36 (Gs), + p09 (Gaia), = 1 
etc., ete 
These lead to 
Fane ee es Pe ee a ROE 
RON y ‘ Pa 
Seas 16 + 13p9? + pot od (272 + 2979? + 60p9* + po°) 
Oe ae : 40320 (1 — p,2)! 


oes 7936 ++ 10841 p,? + 3651p, aL 251p,° ie Po® 


Ayo 3,628,800 (1 — py?) , ete. 


Accordingly we have, raising (xlviii) to the mth power and expanding the 
exponential after the term in a,’ z?, 


1 | 1 
(cosh z — py*)" (1 — po)” 


eee : ote 
e7 M422" 1 — nay’ z*— na, 2° — n (ag — dna?) z8 
— 1 (Ayo — Nay dg’) 2° — n (ayy — Nag! dy’ — Anag’? + bn2aq'8) 2? + ...}. 
23— 


9 


of 
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Remembering that 


Lo nz fied 2 
| e 2(1- Pi) atede = V2, / 1 — Pot 2s — 1) (25 — 3). 
= 00 ns 
we find 
fees (" dz ; Vom a) = | Z port 2 1 Ypot+ 12p 2+ 4 
"~~ Jo (cosh z— pg2)" 2. (1 — py?) n + 128 n2 
if T5po° + 90po* — 20p9? — 40 
1024n 
3675p9° + 4200p9° — 2520p9* — 3360p,” — 336 | ) : 
+ 39768n4 + etc.) ...(xlix). 
peat ait 
Ee ON ate eye \e ' 
Bit yg, SE UDP ae ee 


T 
and thus we have 


See 
Yn = AE ea P°)> Xo \P> 


i dr(en) Galen) gale Mean 


n—-4 n—4 
yee = pe ree 
where Xo (9, 7) = (1 — pry 
is symmetrical in p and 7 and 
rp +2 3rp + 2)? \ 
pi (pr) = an , pe (pr) aL ee a) ? 
y _ B15 (rp)? + 18 (xp)? — 4 (rp) — 8} 
ds (pr) 1024 ’ ( of 1), 
eae 3675 (rp)4 + 4200 (rp)3 — 2520 (rp)? — 3360 (rp) — 336 
aE 32768 


thus depend only on the product of p and r. 


We may write 


n—2 iq 4 Pile?) sleO tos (on emanee) 
Ui = rea aes yx (ps) {+ G21) Got? @=) Ge = 
ae (li1), 
where log x (p,7) = — (n — 1) log x, — log x,, 
and X= Rete ‘ 


{(1 — p’) (1 — 7°} 
_ Vn {(1 — p?)(1—r2)}2 
(1 — pr)? 


X2 


> 


both being symmetrical in r and p 
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a) F 
Table C in Appendix gives the values of log Ga log (1 — p?)?, logy, 


log yx, $1, ba, $3, ¢4, and enables the ordinates of the frequency curves to be 
calculated with considerable rapidity for n = 25 and upwards*. 
Approximate Expressions for the Mode. 


i dz 
Jo (cosh z — p¥)”’ 


Writing [,/ = 


1 V20 : Se eae es ‘i ee +), 


2 (1 — pr) n ni we 


where ¢,’ stands for 4, (pi). We now use Eqn. (xxxix) and find 


we find ee 


eat ae ea 
p—pre n—4 n 


R= (14 i SE ah ye + er ae ! a to) 


1 we deduce 


If we expand this in inverse powers of 7 : 


R=1 oy re oi? a 2p," Oa d° An py” a py =z 3hy py =a 3py! a 33) Ey 


1 (ae (n — 1)8 (n — 1)4 

Again 

ee) pee yg a ee 

n—A n 2(n—1) °° 8(n—1)? © 16(n—1)3 © 128 (n— 1)! 
Thus we have 
Wire onal aa | 2) ae 63 — by : 373 — 2446, + 16¢,7 — B2pz 
p — pi® '2(n—1) 8 (n — 1)? 16 (n — 1)3 

,_ 8987 — — 8164,’ + 192d,’ — 1284,'3 + 3844,’ do’ — 25649’ — 38445’ 


128 (n — 1) 


Bringing the first term on the right to the left we reach after substituting for 
the ¢’’s 


' ( 5(1—) , (61—p(1—#) (867 — Spr — 2p?) (1 — 7°) 
Wie af us 
feet 0G =e1) 8 (n — 1)? 16 (n — 1)8 
17606 — 195pr — 81p?7* — 50p?7* 5 
. | ipo GEE = SPREAD it ae (liv). 
256 (n — 1)! 
* The ordinates calculated by the rising difference formula were tested in this manner. For n = 25 
the accordance was excellent, and quite good enough for practical purposes at » = 10. Below this 


(lii) becomes less reliable and needs more terms. 
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This series has now to be inverted and leads after considerable algebra to 


= (1 _5(1—p2) , (61 — 101p%) (1 — p®) , (367 — 1480p? + 1273p*) (1 — p?) 
=p(14+57 Laie ) 


= 1) 8 (n — 1)? 16 (n — 1)3 
_ (17606 — 12572 7p? + 246783p4 — 143782p%) (1 — p?) | ) (lv) 
q 956 (n a 1) Tim <eleren fiiehelenereys. . 


The above series is of very considerable interest from more than one standpoint*. 
In the first place it appears that Soper’s approximation (Biometrika, Vol. 1x. p. 108) 
was not valid. He obtained 


3 (1 — p? 41 + 23p?) (1 — p? 
Fp fl ( p*) 1) + 23p*) ( Rae I 


SAG) 8 (n — 1)? Bae nt 

Thus for n= 25, p=-6, Soper’s formula gives -62811, and (lv) gives -64205, 
while the exact value is -64194. It is clear that the coefficient 3 in Soper’s 
second term of the series can never approach the % of the more exact expression. 
At first the difference was found very perplexing, especially when the algebra had 
been verified; but the solution appears to lie in the consideration that the best 
fitting Pearson curve to the frequency is not one tied down to the range — 1 to + 1. 
That curve is fitted by two moments only, but if we fit a curve by the first four 
moments and use the general expression 


Y = T | Oy V By (Bs a 3) (Ivii) 
T 2 (5B. — 6B, — 9) starts arelalaieta ove) etefoxerstansteretstatenetets F 
we obtain r= -64192 


a value close to the true value. In other words the use of the third and fourth 
moments to find the mode is far more important than fixing down the range to 
the theoretically possible values; that process determines much more quickly the 
form of the frequency curve, but it does not give nearly such a good fit as allowing 
the Pearson curve freedom to adjust itself by means solely of the first four 
moments}. On the other hand a Pearson curve determined by the first four 
moments does describe fairly accurately the frequency distributions of r for 
m= 25 and upwards: see p. 337. 


(6) Equation for Modes and Antimodes (n = 3). 


Still another method of approaching the modal value has been found occasionally 
of service f. 


* We have used the expansion in terms of 1/(n — 1) rather than 1/n as (x — 1) appears to arise more 

simply in all the formulae. The form in 1/n is 
5 (1 — p? 81 — 101p?) (1 - p? 651 — 1884p? + 1273p) (1 — p? ; 
pe ice See reac! CaN AKGD pata pune ') ay. 
2n Sn” 16n 

+ The Pearson curve determined from the range does not give good values of the frequency for 
n = 25, even when we use the true values of 7 and a, and not Soper’s approximations to these constants. 

+ It was used successfully in calculating the antimode in the case of samples of three, when the 
correlation in the sampled population was low. It gave a fairly good ‘“‘jumping off point” even for 
higher values of the correlation. 
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Starting from the equation (xxxix) 


(n — 4) PL, 4 = (6? — 4) (n-1) I, 
Jo (cosh z — p?)-1 


expand J,, and I,,_, in terms of powers of p?, the coefficients involving 


where I, = and ¢? = pi, * being the modal value of r, we may 


rola 


< dz 
|, Goah ara = | , Sint 8d = a 
We find at once 


Ey : n—Il1)n, 
(0 4) Bd + (m= 1) By + OP Bhan 
(n—I1)n(n4+1),V 
+ 1.2.3 TOP Gate 1 a0 cont ) 
» n+1 
= (9° = BY) (1) (dn + MB as tO Btu 
_ a(n + 1) (nF 2) 
| 1.2.3 OG aa ea Ronee Hi 
Rearranging and substituting p/ for 6? and noting that 
ey a 
Im+ = — m Ym-1> 
we have 
p( — 1) dn = Fra (m — 4 — p2(n — 1)2) + }(m— 1) Fpqy (2 (» — 3) — p2n?) 
Ge eT aye (ei 2) = oO 424) eter. xt vinl), 


where the form of the successive terms is sufficiently obvious, and the series con- 
verges rapidly if p be small. 

For the particular case in which we have chiefly used this equation to determine 
7, namely samples of three, * corresponds to an antimode and the equation is for 
N= 3: 

2p = — Fi (+ 4p%) — F4q9p% + Fqyp? (3 — 16" 

+ #4 g5p8 (8 — 25p%) + Fqgp% (15 — 36p2) 
+ #8 4,p5 (24 — 49p%) + #7 qgp° (35 — 64p2) 
ET EMOUC Menten mR ene Se she ues Pe PSaN McG eee ss one duneRET: (ix): 
An equation which led to * with singular accuracy and comparative ease for small 
values of p by aid of Table X for q,. 


(7) Tables and Models. 


Table A (p. 379) gives the values of the mean, of the mode, of the standard devia- 
tion, of 8, and B, and thus of the skewness of the frequency distributions ofr. It 
will be seen that long after we have reached the limit of what are usually treated 
as small samples, the skewness of the distribution of 7 is very considerable. The 
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approach to the normal curve is very slow, and the “ probable error of the correlation 
coefficient,” i.e. 67449 (1 — 12)// n as usually recorded, has very little worth. 
Models have been prepared to illustrate these points as follows: 

Model A gives for n = 2 to n = 25, the distribution of r for p = -6. 

Model B gives for n = 2 to n = 25, the distribution of r for p = -8. 

Model C gives for n = 3, the distribution of r for p = 0 to ‘9. 

Model D gives for n= 4, the distribution of r for p = 0 to ‘9. 

Model E gives for n = 25, the distribution of r for p = 0 to -9. 

Model F gives for n = 50, the distribution of 7 for p = 0 to -9. 


(Further models are in process of construction for low values of n.) 


Even the photographs of such models form a striking warning of the dangers 
which arise (1) from small samples, and (ji) from judging results from even repeated 
small samples; the modal value of the frequency distribution for the correlation 
of these will be very sensibly higher than the correlation of the sampled population. 


(8) On the Determination of the “most likely” Value of the Correlation in the 
Sampled Population, 1.e. p. 

We now turn to another point. Suppose we have found the value of the 
correlation in a small sample to be r, what is the most reasonable value p to ene 
to the correlation p of the sampled population ? 


Now we know that 
(1 — p?) 2 onus dz 
= (ny = ON So le tee Mente Soe 
Yn (n ) (1 r ) I (cosh z — pry?’ 
and if ¢ (p) dp were the law of distribution of p’s, we ought to make 


—] n—-4 


a—? s@0-9? | oe 


n—2 d 
cosh z — pr)"-1 E 


Ci 


a maximum with p, or in other words deduce the value of p for a given r from 
n-1 
d | (1—p*) * (p)dz) _ 
dp |! (cosh z — pr)"-# 


Fisher puts ¢ (p) equal to a constant and then differentiating out reaches the 
equation 


Re (r — pcosh z) dz _ 
Jo (coshz— pr)" 
which should provide the value of p in terms of 7. He solves this only to a first 
approximation, obtaining, 
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= 5 % FREQUENCY. 


ALL CURVES ARE OF SAME TOTAL AREA. 


-O-1 O 2 +03 40:4 +05 +0-6+07 +08 +0'9 +H:0 
“60-09 08 -07 -0-6 -O5 -04 -0-4:-02 -O:l 0:0+01 402 = 
CORRELATION IN SAMPLES. 


Correlation in Smail Samples, p=0-0 to p=0-5 for samples of four. Model D, illustrating forms of frequency 
passing from the rectangle to marked J-forms of curves, which occur in this case, 
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This is a lower approximation than Soper’s second approximation, in similar 
cases, and we know that even Soper’s values are not sufficiently accurate when 
m is as large as 25. Hence no very great confidence can be put in (Ixil). 


But there is another point about (lx) which is of great importance. Fisher’s 
equation, our (Ixi), is deduced on the assumption that ¢(p) is constant. In other 
words he assumes a horizontal frequency curve for p, or holds that @ prior? all 
values of p are equally likely to occur. This raises a number of philosophical 
points and difficulties. We ask: 


When we are in absolute ignorance as to p, is it according to our experience 
that all values of the correlation are equally likely to occur? We think this 
question must probably if not certainly be answered in the negative. Very high 
correlations are relatively rare, and most biometricians would find it difficult to 
cite straight away a couple of cases of the correlation equal to — -95 although they 
could cite a score in which the correlation was sensibly zero, or again about :5. 
Every biometrician is seeking high correlations, for these are for him the all im- 
portant data, but he knows how difficult and rare they are to find*. The equal 
distribution of ignorance which applies so well to many statistical ratios, does not 

~seem valid in the case of correlations. We generally know quite approximately 


* We have recently had occasion to table (a) nearly 400 correlations between characters of the 
human femur, and (4) over 300 for characters of the human skull. The distributions were very far 
indeed from horizontal straight lines, and to suppose @ priori such distributions horizontal could only 
lead to grave errors. 

y A similar problem arises in the case of standard deviations. If = be the s.p. of the sample and 
o of the sampled population, then the frequency curve for s.D.’s is (Biometrika, Vol. x. p. 523) 


as the ‘‘best value” of o. 


This was pointed out to the Editor by “Student,” and was a desirable criticism of the statement 
made (Vol. x. pp. 528-9) that the most reasonable value to give to = was the mode of the sampled 


y — 
population, i.e. to take the observed T= 2Y= —_ o or suppose 


a ee PP gt Sia ane (B). 


Equations (a) and (8) are not identical. But again (a) is based on the assumption that all values 
of the s.p. are @ priori equally likely to occur. But surely this is not a result in accordance with our 
experience! Values of o from 0 to o are not equally within our experience, and there is almost an 
absurdity in talking about a standard deviation varying from 0 to «2; are we to include all possible 
scales in this distribution? The s.p. of stature might certainly be anything from practically zero to 
infinity if we measured it first in ‘‘light years” and then in microns. Or, are we to measure our S.D.’s 
all in the same units, when we suppose the distribution of s.p.’s to be of equal probability from zero 
to infinity? How is this to be done in the case of an absolute length and an index? Given a definite 
problem, there is certainly no d priori likelihood that the s.p. will have every value from 0 to o, if we 
confine ourselves to one scale. It must practically be less than the mean value, and in most actual 
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the correlation of the characters in the population samples, and desire to ascertain 
whether a small sample of some population sinular to a known population confirms 
our experience. 


For example, we may have twenty pairs of brothers recorded for some special 
character. Our @ priori knowledge is certainly not that all correlations between 
pairs of brothers from — 1 to + 1 are equally likely to occur! On the contrary 
we anticipate a value which will not be very far from 0-5. And this @ priori 
conviction is so great, that if the small sample did not give a value which con- 
sidering the size of the sample was compatible with the correlation in the sampled 
population being near 0-5, we should suspect errors in the measurement or some 
form of disturbing selection. In such cases, and something like them appears 
to us most frequent in biometric practice, it is we think erroneous to apply Bayes’ 
Theorem. All it seems possible to do is to assume that we have drawn a value 
near the mode of our distribution, for our sampled population is much more likely 
to have a single value, that of our @ priori experience, than every possible value 
from —1 to +1. If Bayes’ Theorem confirms this value—so much the better; 
if it does not, its fundamental hypothesis is usually so unjustified that it seems 
most unreasonable to assert that it must give the most likely value of the 
correlation in the sampled population. 


The fuller solution of Eqn. (lxi) thus appears to have academic rather than 
practical value. Still certain points of theoretical interest arise in the discussion 
of both (Ix) and (xi). Let us suppose that our @ priori knowledge consists in the 
distribution of p about a mean p with a standard deviation x. It is convenient to 
take x2 = m (1 — p?), where m is an arbitrary constant. Probably « = 0, whenever 
p —1, and this suggested this form; but since m is quite arbitrary we lay no 
stress on this point. The equation to determine the most likely value of p now 


becomes 
is deo loeie)e. 
20 = PON OZ 2m (1 - p)? 
ft (heo) e 2, 
dp } 0 (cosh z — pr)” 
=f) (Ql yy? - a 
or aa p) ( aia) Dac pla — p32) tlie eee (Ix). 


m (n — 1) (1 — p”) 


Now this equation cannot in general be solved unless we know the order of the 
product m (n — 1). Certain cases, however, can be considered. If m be very 
large, i.e. if there be very considerable scatter in our past experience of p, then 


7a Pie yeaa 6 Meera) Ia! Bese atone beewind Stinban ionprcodoDoc (Ixiv), 


problems is very narrowly limited. For example we measure twenty individuals of a population for 
stature, and seek the best value of the variability of the sampled population from the result. Would 
it not be unreasonable to suppose that @ priori this variability may he equally likely to have any value 
from 0 to? Ourd priori knowledge is that it is somewhere between 2’’-5 and 3-0 and very far from 
equally likely even between these values. To justify the equal distribution of our ignorance, we should 
have to assume that we neither knew the exact character measured, nor the unit in which it was measured, 
and such ignorance can only be very exceptional in the present state of our knowledge. 
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an equation identical with what we obtain by the “equal distribution of our 
ignorance.” The same result is also reached if m be only moderately large and 
n very big. In other words “the equal distribution of our ignorance,” even if we 
really have some knowledge of the frequency distribution of p, will not lead us 
badly astray in the case of big samples. The matter is quite otherwise, however, 
in the case of small samples; unless our knowledge is very limited (m very large) 
we have no right whatever to take (Ixiv) as applying to such small samples. Indeed 
when m is fairly small p will not differ substantially from p, and the solution of 
(Ixiii) will differ widely from that of (lxiv). We may consider these cases in 
succession. 


Case (i). Very slight knowledge of p, or on the other hand a large sample. 


Here we are justified in using (Ixiv). We can attempt its solution in two 
different ways as in the case of the mode. 


Let fp be the most likely value of p and let us write pr = p,?, then 
(i= p”) rl, oa (al 


A 


or Via Pay) T, = pyly 4. 
Now let p,” be a first approximation to p,? and suppose p,? = po? + €, where 
e is small. Then 
(r? — pot — 2pote) (In! + nel’ nt) = (po” + €) (Ina + 2 —1el,’), 
“ dz 
1, =| | 
where n le (cosh 7 por)” 


Hence remembering that 
m (1 — pot) I’nga = (20 — 1) py?’ + (0 — 1) Tyas 


te (7? —— Po’) = Pon 

ud © Dana + po? (m+ 1) Ey = 0 7? = pot) Dnga’ 
or 
a= (1 ee Po") : (ie az Po’) i, = Por a 

(1— pot) — (m—1) (7? — po’) + po™{(m + 1)(1— po*) — (20 — 1) (7? = pot} B, 

Merrie e aah ask oh (Ixv), 
>, n—-2 : 
where (1 — po*) (n — 1) EB, = (2n — 3) po? + Bo es (Ixv1), 
n—-1 

and ESL wae 


Now (lxv) and (Ixvi) may be treated exactly like the corresponding equations 
for the determination of the mode. If be moderately large, we may put 
k, = es = # 
in (Ixv), and if we know p,? obtain the value of HE’, which value substituted for 


£,, in (Ixvi) gives us « and thus a new approximation. If we cannot guess a good 
value for po? (although p,? = p? is in this case usually sufficient) we can treat the 
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numerator of (Ixv) equated to zero, and (Ixvi) with H’ for #,, and E,,_, as simultaneous 
equations to find #” and p,”, and so obtain a good approximation straight off, 
when » is of the order 25, or a fair first approximation when 1 is smaller. 


Applying this we have from (lxv) 


Ey, = En—1 = po" (t? — po’). 
Hence 


(7 — 1) (1 = po*) pot = (2% = 3) o> (1? — po*) = 12) ogc er Os 
= eye ea 
of Or oa n—1—r??’ 


Se. SS 
and therefore pn x Ri oe (Ixvii). 


n—l—*?r 

Thus it will be seen that on the hypothesis of the equal distribution of ignorance 
for n = 100, the ratio of p to r will differ less than -99402 from unity. On the other 
hand if n be 5, the ratio of p to r may differ from unity by as much as -8660 does. 
For example, if n = 5, then p = -26278 if r= -3. But for a sample of five the 
standard deviation of a value of p between -2 and -3 is of the order -18 to -20, so 
that there is little to be gained by treating the observed -30 as corresponding to 
a sampled population of -26. 


We shall now proceed to determine an expansion for p. If R be the ratio of 
Eqn. (iii), we find from (Ixiv) 
par —p) L/L 
ee as 
Be pee i) n lp 


ue p?) / 1 \s (1 =% fot oe pee dy a mS 


whence on substituting 
par(y Lice Om) 1 
co 2n-1° 8 Sn we alg (n — 1) = 
and after inversion 
Natl Liar (or) (er) er ire) & 
p=r( 9 yh — Al 8 (n — 1)? 16 (n — 1) ) 


> 


This result gives us a measure of the correctness of (Ixvii), for that equation may 
be written 
TNS 


-\-F7 (1+ 3r%)(1— 79) (1+ 2r? + 5r4) (1 — 7?) 


(ah) 8 (n — 1)? 16 (n — 1) 4 .-}(xix). 
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Thus the divergence begins as early as the term in 1/(n — 1)? and (Ixvii) can 
only be trusted for rough approximations to /. 


6 


Illustration. Suppose r= -6, what is the “most likely” value of p, on the 
assumption of equal distribution of ignorance? Let n = 25, then we find from (Ixviit) 


p = 59194, 
while (Ixvii) gives -59182, an agreement adequate for most statistical purposes. 
If n = 5, and r = -6, we find 
p = °55058 from (Ixviil), 
= :56695 from (Ixvil). 
There is now considerable divergence in the two methods and another approxi- 
mation is desirable. Let us take po? = pr = 33035, then to find #,, we have 
n— 2 


(n — 1) x -890,86894,, = (2n — 3) x #33035 + E 
n—1 


E, will be given by (xlvii) and equals -885,5939, whence we determine 
FE, = 1-189,9819, H, = 1-246,8946, #; = 1-324,1082, 
and accordingly from (lxv) 
« = :001,3081, 
Vp = po? + € = °331,8581, 
and p = -553,097, a value not far removed from that found by the first approxi- 
mation. We conclude that even when v is small, quite good results will be found 


from (Ixvili) and that it is probably better to use this rather than (Ixvii) in such 
cases as the starting point for a second approximation. 


Case (ii). Close a priort Knowledge of p. 


We will now suppose m small, so that the first approximation to p may be taken 
as p. We substitute in (Ixil) p = p + and we find, neglecting squares of %, 


G oy a p ah ) Laz ct (n ar 1) pl) a ql a p? ine 2p) Qe Al nrpl ss) 


/ 


r D 


dz 
| 9 (cosh z — pr)r-2* 


Ll 


where 
This leads us to 
(1 =p") rl, a, pli 
yee Eh ae 
La(l+ cq@oy) tot Dol 0-79 nla 


whence remembering that 


n (1 ae (7p)?) Iie aa (2n an 1) rpl, tr (n ag 1) Tos 
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and writing #, = I,,/I,_,, we have after some transformations 


+p ie 
| (1 — r2p2) (1 — 2) 2, — p) 
(1= 199%) (1 + ag yy) (D0? LP) + 9B (m+ YL — 7794) — Cm — 1) (L— PPE, 
eae were (Ixx), 
where (n — 1) (1 — 12p2) E,, = (rn — 3) rp + — snide clepaneteeree (Ixxi). 


The method is now straightforward, at least for n moderately large. We put 
E,, = E.,_,= E’ in (xxi) and substitute the resulting value of H’ for £, in (Ixx), 
and thus reach the small correction on p. 

Illustration. In a sample of 25 pairs only of parent and child the correlation 
for a certain character was found to be -6. What is the most reasonable value to 


give to p in the sampled population ? 


If we distributed our ignorance equally the result would be that stated on 

p. 357, i.e. 
p = °59194. 
But, in applying Bayes’ Theorem to this case, to what result of experience do we 
appeal? Clearly the only result of experience by which we could justify this 
“equal distribution of ignorance” would be the accumulative experience that in 
past series the correlation of parent and child had taken with equal frequency of 
occurrence every value from — 1 to + 1. To appeal to such a result is absurd; 
Bayes’ Theorem ought only to be used where we have in past experience, as for 
example in the case of probabilities and other statistical ratios, met with every 
admissible value with roughly equal frequency. There is no such experience in 
this case. On the contrary the mean value of p for very long series of frequencies 
of 1000 and upwards is known to be + -46 and the range is hardly more than -40 to 
-52. We may accordingly take p = -46 and m (1 — p?) = x? = about -0004, whence 
m = :0004/-7884 = -000,507 say. 


Thus = 82-1828 and the term containing it is the dominating term in 


m (n — 1) 
Equation (Ixiii). Thus # will differ little from p. We find 
5 — 7. 1370068, — 424,959 _ 

p=? 70-034,491 + 2°790,9252,,’ 


from (xx). 
We next determine #, = LH’ from (Lxxi), ie. 
24 x -923,824E"2 — 47 x -276H’ — 23 = 0, 
which gives us E’ = 1-352,2185, thus % = :00225 and 
p = -46225, | 
a totally different “most likely value” from that obtained by “equally distributing 
our ignorance.” 
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Statistical workers cannot be too often reminded that there is no validity in 
a mathematical theory pure and simple. Bayes’ Theorem must be based on 
experience, the experience that where we are @ priori in ignorance all values are 
equally likely to occur. This is not the case in the present illustration, and we 
must use our past experience in the same way as we should use our past experience of 
equal frequency ; the appeal to this experience has here absolutely the same validity 
as in Bayes’ case and cannot be for a moment neglected. We see that our new 
experience scarcely modifies the old and this is what we should naturally conjecture 
would be the case. If we increase the size of the new sample, then ultimately 
1/m (n — 1) becomes very small, and we approach nearer the value -59194 given 
by Bayes’ Theorem. But past experience will bias the value obtained from the 
new material for a long time, and we see that according to the value of the past 
experience p may vary from -46225 to -59194. It will thus be evident that in 
problems like the present the indiscriminate use of Bayes’ Theorem is to be 
deprecated. It has unfortunately been made into a fetish by certain purely 
mathematical writers on the theory of probability, who have not adequately 
appreciated the limits of Edgeworth’s justification of the theorem by appeal to 
general experience. 


Case (ii). Past Experience a Factor, but not the Dominating Factor of Judgment. 


Cases can arise in which p = p is not a very close approximation, 1.e. when we 
have some past experience, but not a very concentrated one of like correlations. 
In this case we must return to Equation (xii), and we shall assume fr = py? + ¢, 


where py” is some fairly close approximation to pr. We shall write pr = py”. We 
find 


2 BEL eo = Pon) Wt Pool 
pal 4 = 2 NO ee E/N eae AER 
Oe Pots |po m (n — 1) (1? — po?) f Pat 


Ft pepe Ope! Go — BO? — py 
14 . I i 21 \F0 “0 AU 
PBaat fmt Ne a fa 


m(n — 1) (1? — po*) 
was deteteaees (Ixxu), 


where M(t One) lpi en — A gt (nm — AT) Deas scteweesaneees (Ixxii), 
equations which can be readily expressed in terms of L’s. 


Unfortunately the approximation obtained by equating the numerator of ¢ 
to zero and using (Ixxiii) as simultaneous equations is not very rapidly obtained 
as the resulting equation is now of the eighth order. It is better from the data 
themselves to guess a reasonable value for py)? and start the approximation from 
this. 


Illustration. The correlation between the maximum length and breadth of 
crania is not very definitely known. Its mean is about -30, but the values deter- 
mined for it range from nearly zero to -6. Assuming the standard deviation to 
be -1, what is the “most likely value” to give to this correlation in the case of a 
sample of 25 skulls showing a correlation of -50? 
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Here p = -30, m (1 — p?) = -01, and n= 25. p.2=rp=-15. We will assume 
as a first approximation to fp, p= -40, hence p92 =-20. Equation (Ixxiii) for 
n=n-—1 gives 

(n — 1) x -96H, = (2n — 3) x -20 + (n — 2)/#,_4. 
Put n = 25, and Hk’ = EF, = E,,_,, and we have to find Z’, 
23-04H'2 — 9-4H’ — 23 = 0, 
which gives #’ = 1-223,7367; from this we deduce £,,,, = 1-225,5025, and 
« = — :029,9238, 

leading to rp = -170,0762, or p = -34015. 

Starting again with p,” = -17008, we find 

23-305,7472H? — 7-99376H’ — 23 = 0, 

giving HOS AAT9;6109 and 2p — lel Sip 3579: 
Whence we deduce e = + -003,999, 
and accordingly 7p = -174,079 and p = -34816, a close enough approximation. 


But if we had “equally distributed our ignorance” we should have found* 
p = -49217. 

These results seem extremely suggestive. If we were to observe the correlation 
of length and breadth of skull in a new sample of 25 skulls, then an observed 
value of -50 would give a “most likely value” on the equal distribution of igno- 
rance of 4922. 

But no biometrician would admit absolute ignorance in such a case; the 
correlation has been determined rather vaguely and not very adequately so that 
results range from something like zero to -6. But this a priori knowledge leads 
on precisely the same basis as Bayes’ Theorem to the value p = -3482—a result 
very much closer to previous experience of the mean value, than to the observed 
result. And there are relatively few cases in which some such, if only vague, 
@ priori experience does not exist. 

In the light of the above illustrations we consider it justifiable to assert that the 
results deduced from the principle of the “equal distribution of ignorance” have 
academic rather than practical value, and we hold that to apply it without con- 
sideration of its basis to the problem of finding the most likely values of the statistical 
constants of a sampled population from the values observed in a small sample may 
lead to results very wide from the truth. 


(9) Special Cases of Frequency for n small. 
We shall now discuss individually the lowest sample sizes. 


(i) Samples of Two,n=2. Here 


* Equation (Ixvii) would give p = -49204, nearly as good practically as (Ixviti) 
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=f! f —1 
and the distribution consists of BOsma?) at r=-+1 and soehae at r= — 1*; 
THF 


TT 


or at what is the same thing } (1 + 7) and } (1 — 7). 
The moments are 


and accordingly 
fe=1—F,  py=—2F(1—7%), po, = (1 — F*) (1 + 37)... (Ixxiv). 


Hence B, = 4r?/(1 — 7r?), By = (1 577) /(1 — 72); 
and accordingly f.—- 6, -1=09. 
TABLE I. Samples of Two. 
p, value of = | Number of Number of 
correlation in oe Bee positive cor- | negative cor- 
sampled aoe er My Bi Bs relations per | relations per 
population of samples | 1000 samples | 1000 samples 
0-0 0 | 1 0 1 500-000 500:000 | 
0-1 063,7686 -997,965 -016,332 1-016,332 531-884 468-116 | 
0-2 -128,1884 -991,750 -066,827 1-066,827 564094 435-906 
0:3 -193,9734 -981,007 156,387 1-156,387 596-987 403-013 
0-4 -261,9798 | -965,007 294,764 1-294,764 630-990 369-010 
0-5 -333,3333 | -942,809 500,000 1-500,000 666-667 333-333 
0-6 -409,6655 -912,236 806,686 1-806,686 704-833 295-167 
0-7 | -493,6334 | -869,670 | 1-288,724 2-288,724 746-817 253-183 
0:8 -590,3345 | -807,164 | 2-139,534 3-139,534 795-167 204-833 
0-9 ‘712,8674 | -701,299 | 4-133,056 5:133,056 | 856-434 143-566 
1-0 1-000,0000 000,000 oo x 1,000-000 000,000 


The distributions are two lumps given by the last two columns, and are accurately 
given by Pearson’s skew frequency distributions for the relation 6, — B,— 1=0 
(see Phil. Trans. Vol. 216 A, p. 433). 


(u) Samples of Three, n = 3. 
= p? 1 dU 


PN re ira tae. 
where a=rp and U =cos-!(—2)/V1— 2 
r+1 r+p 
as before. Hence = | r?y3dr = 5 | x? Yn dx 
ve po’ s =p 


a ES 
eae j_o_ ag, 
SP Vp? — x dx 
* Since we can determine at sight whether any pair is positively or negatively correlated, this 
gives a method of determining p by simply counting the number of + 1 and —- 1 correlations in the 


ap? 


m aN eee: 
arrays, Say m, and m,, then p= cos 7 (_) and the probable error of the determination is 
My + Mp 


67449, (oe) 
Ta ON Fe) 


N being the number of pairs used. Cf. “Student,” Biometrika, Vol. vt. p. 304. 


Biometrika x1 ae 
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Now let «=psind, then 


1— p? joe 
ine = me | sin?’ an dd 
“a 
BN ACHUR SOHO) 
Now sind ee Woe 
ES ones 
hence Ty a * sin? os =>: 7, 


es ie ees me oe 14Ud¢. 


a dp 
dz 
si u-| cosh z — p sing’ 
l—p?d (~ +3 gine 1d 
—— ——dd.... 

cas Ee 7 call de | ah ea $ 
Write poti¢ and coshz= 4, 

Sey ye ies) po ( _ |)\p—l p-1i,d 
we have fi eat | dz | (] Ase ae (Ixxv). 

a dp!y 0 7 + pcos wp 


Let p= 1, then we find 


ieee ae Gp ee 
: 7 dp | 0 log +p)+q— p) tan? 5x 
But 
fe d (tan 3%) pale nea tangy \1"_ 7 1 
!0(7 +p) + (q — p) tan? dy |v? — p* aie 2/7? — p? 
7 U Ress tas 
; at dz 
Thus ba = (1 — p*) ap I ‘/qiapt 
Now take i ame cosh z, 
hence CORD dd’ = sinh zdz, 
sinzip: 
= V7? — 1dz, 
thus dz = — cosec ¢'dd’. 
; d [3 dc’ 
It follows that fi, = (1 p>) Dp lier ar nea 
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7 
2 


7 ra, d 
ctran iene F (i °) =| Ta 
and E(t, 3) = f Via sas bad. 
a (p, 5) 
Then we have fy = (1 — p?) 5 — 
But Se et say 
where ki =V1—#, 


FE, and F, denoting the complete elliptic integrals *. 
Thus finally 
p= ; Maa" (Al aap NBL os tenga guec cunts ox ean: (Ixxvi), 


and 7 is known, as soon as p is given, from tables of the complete elliptic integrals. 
Returning to Equation (xx) and putting n = 3 we have 


p® 


4 8 
F+o+E...) 


ee =p ee + 
ve = mo PK 9 3 
al os Peet ey eigen 
aay aa \-p — 9 (p*)* — 3 (p?)? — 3 (p*)* — 
1l1-— 


—- Pee, (Ea PP Piaa se aetciys cyotroaeiabesas sees sseeee (Ixxvii), 
p 


=1+5 


and further 


2 
loo (dl a2 watesewecaeea (Ixxviil). 


Ti— 
OF = p= l-P +s # 


We now turn to the third moment and may anticipate a recurrence of the 
elliptic integrals. We shall obtain our result on the whole most briefly by 
appealing (o Equations (xxii) and (xxi) on pp. 335 and 336. We have 


(x 
: (1- 33, .\ 1, p(B 8 a a 
is eed ace 9? 2, p*) 3% F(5, 5? 3, p*)> 


a 


or since I, = 


(3.5... 28 + 1)? »/] 
8 Se@teraT? 


2 
ee eco cea aera a 


* Cayley, Hlementary Treatise on Elliptic Functions, p. 48. 


24—2 
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oe ae 


2 2 


CEO CEG ey a 
Dy. 25 (SEE 2)ii(s a2) @ sco ee 


_(l—p)ad {s (8.5... 28 +3)? s+1 pit 
~ 2p? Qdpl” | 8 (e472) Gio) Were gs 22 
-- 


or 


where E, and F, are as before the complete elliptic integrals. 


In order to obtain the fourth moment coefficient about zero we will return to 
formulae (xx) and (xxi) of pp. 334-5 and write 


, n omer “| 
py =1— aia] (l= p*) fo 
, , n(n — 2) 
= Yu. 1s SAR No 22 f 
and Ha B2 it i (n aE 1) (n = 1) (1 P ) Ta 


where f, and f, are the hypergeometrical series. 
Now the general term of f, is 
(2. 4e618\.. 25-4 Depa 
(s+ 1)! (m+ 1) (n+ 8)... (v1 + 28 4 1) 2541’ 


and the general term of /, is 


(4.6.8... 28 + 2)? p% 
(s)! (w+ 3) (nm + 5)... (m + 28 + 1) 28° 
Hence it follows that Dats atae: ae 
4p dp 
or we have 
ae) " n(n — 2) aoe. 
areal mecaiien: eager oT) Dae siotekererecatets (Ixxx) 


Thus if we are able to sum f, algebraically, we can determine ji’ algebraically * 
* The corresponding formula for pu,’ and p,’ = 7 is . 
fis’ = F — (1 — p*) (n ~ 2) fe @ eect ee (Ixxxi). 


If we put » = 3 and P= = (By ~ (1 ~ p*) Fy) we find 
,_2- pp 2(1- 6) 
~ pa olan ae 


Fy, 


confirming the result in Eqn. (Ixxix). 
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Putting n=3 and writing f, = —s log, (1 — p?), we have 
3 (1 — p?)? d {oe (eS eI 


Pee tae al oe lop, (ls— p?) — = 
(a p2 UNS aay 8 9p dp p 


(Ixxxil). 


iyo? 
= L4GP Bp? + (3 + p) Togs (1 p29} creeeseeseeerrnenn 


This completes the moment coefficients for samples of three. 


As p may be determined by considering the ratio of negative to positive cor- 


relations in samples of two, so it may be determined by considering the ratio of 
positive to negative correlations in samples of three. Let m, be the number of 


positive and m, the number of negative correlations, then since 
1—p? 1 1dU 
la 7 V1—r27 dp’ 
see l1—p?d ies Udr 
eee 7 dp!o rV1—7 
ep? a Ke cosq! (— pr) dr 
0 rV(L— 7?) (1 — 7292) 
Sey pi heit ees ie 
ee ee 1—p2 d | cos G pr) di 
a7 dp. 


a dp. 


-1rV(1 — 92) (1 — 2p?) 


Now put in the latter integral r = — 7, then 
1—p?d li — cos—! (pr) dr 
Ny ——.. 
a dplo ¢V(1—r%) (1 — rp?) 


But cos“! (— pr) + cos! (pr) = 7, hence 
My — My neu ] oy dr 
My +m a p M7 »4/ 2) (1 — 7292 
p + Mn Pe iN (WF) (= 7p?) 
pd rodr 
=(1=p)| 
ie) Val at 


orifr=sing, = (1 — p?) fe fo Jeo ds ae 
“0 (1 — p? sin? hd)? 


2 d (p cos ) 
0 (1 =p? + p? cos? fy? 


Say 


Or again if pcos ¢ = V1 — p? tan 0, 
rsin™Pa/T — p2cos?@ dO 


Mp — mM 
ae p?)? cos?0 


= eee 
My + My aa ay 


0 


sin71p 
= i cos 0d0 = p, 
0 
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or p= (Ng =m) Mp EM, ence ace ee (Ixxxiii), 


a very simple formula if m, or m, has been found. 


Clearly p= (2m, —N)N, 
and Sp = 28m,/N, 
Ope => 40° 1 = 4 ae ( a ) 4 
Thus the probable error of p found in this manner is 
67449 My My) : 
ee) — 2? Seba 
Sana ie (1 i) wih ashacecnhe eee (Ixxxiv), 
and can easily be evaluated, for it gives: 
674499 ee 
Probable Error of p = - VN V1 =p? Se eee (Ixxxv) 
We see it is larger by the factor a 5) which is greater than unity, than the 
=p 


usual value for the product moment process of the correlation. But a new point 
arises: N is the number of triplets in the present process, and N the number of 
individuals in the product moment process. If we take M triplets and N individuals, 
we have to compare 


67449 V1 — p?/VM_ with -67449 (1 — p®)/WN, 
and these probable errors will be equal if 
M = Nj(\ —p?). 
If the number of triplets be > N/(1 — p?) the triplet process will be more accurate 


than the product moment method. The number of triplets required for equality 
of probable errors are for the various values of p: 


A PS p=D M=1-3338N, 
p=:1 M=1-010N, p=-6 M=1-563N, 
p=-2 M=1.042N, p=-T M=1-993N, 
p=:3 M=1-099N, p= 98 = 2178 Ne 
p=-4 M=1-190N, p=9 M=5-263N. 


This series would seem to suggest, since a triplet contains three individuals, 
that to use the triplet process with equal exactness with the product moment 
process, in the case, say, of p = -5, we should need a population of 4N. But this 
assumes that each triplet is based upon three independent individuals. Actually 
a population of N provides +N (N — 1) (N — 2) triplets and if these could be 
considered as an independent sample of M triplets, we should have a less value of 
the probable error of p by the triplet process using all possible sets than by the 
product moment process, provided ; 

pests W=DW=3)" 
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For example if N = 10, for all values of p between + -957 and — -957, the 120 
triplets will give a better result than the 10 individuals. Even 50 triplets would 
be better than 10 individuals for all values of p between + -894 and — -894. But 
the question arises whether we can consider the 120 triplets from a sample of 
10 individuals as much a random sample as 120 triplets from an indefinitely large 
population, and this can hardly be the case. It may be, however, that 50 triplets 
out of the 120 would be sufficiently independent to give a better result than 10 
individuals. It is very desirable that a full study should be made of such restricted 
sampling, for without such study it is not possible to assert how far the probable 
errors of doublet or triplet procedure are greater than those of the product 
moment method. 


Of course in such a case as that referred to, the labour of the triplet process will 
be considerably greater, for we have to determine the sign of the correlation in 50 
or 120 cases, instead of applying the product moment process to 10 individuals, 
and the labour rapidly increases with increase in the size of the set (doublet, triplet, 
etc.) and the size of the sample. Still the labour may be worth while in the case 
of small populations, where the best result is of considerable importance. We 
have not endeavoured to extend the theory to quadruplets or quintettes, because 
the labour of determining the sign of the correlation in these cases is very con- 
siderable. 


In the case of triplets, we require the sign of the product moment 
LyY1 + LeYo+ @3Y3 (Ly + Lo + Ws) (Yr + Yo + Ys) 


3 9 , 
or the sign of (%_ — Xj) (Y2 — ¥) + (3 — 24) (Ys — 9). 

Now suppose the triplet arranged according to the character x in ascending 
order %, %, %3, then x, — x, and x; — a, will always be positive, and accordingly 
if either both y, — vy and y; — ¥ are positive, or both negative the sign of the 
correlation is obvious. On the other hand if y,— y and y;— 7 are of opposite 
sign, the matter has got to be a little more carefully considered. But if 7 has been 
found and the above differences determined, in most cases it is not needful to 
actually multiply out, in order to realise the sign. A graphic process depending 
on the plotting of the triplet triangle seemed on the whole more laborious than 
the above. 


Of course in samples of three the U-shaped distributions give a minimum where 
dy/dr = 0, and we have therefore an antimode, not a mode. The values of this 
antimode are recorded in Table I, p. 368. It will be seen that all the antimodes 
are negative for positive correlations in the sampled population. The antimode 
asymptotes to the value — -613,9616, which it reaches when p= + 1. In this 
case the value actually fails at p= -+ 1, for the equation dy/dr = 0 is satisfied 

n-1 
then for all values of 7, owing to the presence of the factor (1 —p2) 2 : see 
Equation (xxxvii). Nevertheless the antimode curve goes right up to the point 
indicated above and this value must be used for the purpose of interpolation 
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Fig. 2. 


Mode (Antimode) and Mean Curves for n=3, and for values of p from 0 to -+1. 


sy 


| Pee | 
PRR eZ 
| SE? 
HR ye 


Value of Population Correlation. 


-10-9 -8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 -6 -7 8 9 10 
Value of Correlation in Sample. 


between p = -95 and p=1-00. The curve is shown in Fig. 2, and both antimodal 
and mean lines on the photograph of the model of the frequency surface. 

The distributions are after p = °2 very skew U-shaped frequency curves, whose 
B,, Bo lie in the U-area of Pearson’s skew curves, which, however, do not 
reproduce the antimode very closely. The ordinates are given in Table A (p. 379). 


TABLE II. Samples of Three. 


Number of| Number of 
p, value of | — é positive | negative | 
| ¢ correlation | Beak r, antimode B B correla- | correla- 
in sampled Bee ee of samples %% : 2 tions per | tions per 
population| ™~ P 1000 1000 
| samples | samples 
0-0 -0000,0000 | -000,0000 | -707,1068) -000,0000| 1-500,0000 500 500 
0-1 -0786,3836 | —-151,2541 |-704,5029| -028,0136| 1-532,3082 550 450 
0-2 | +1578,7706 | —-275,5141 | -696,5708| -115,2406| 1-632,9424 600 400 
0:3 | -2383,6407 | —-367,9037 | -682,9273|  -272,2374| 1-814,1763 650 350 
0-4 | -3208,5431 | —-435,4082 | -662,8536| -520,5707 | 2-101,1480 700 300 
0-5 -4062,9889 | —-485,5321 | -635,1363| -901,7817| 2-542,3242 750 250 
0-6 -4960,0160 | —-523,8811 | -597,7313| 1-500,1840| 3-236,2938 800 200 
0-7 -5919,3885 | —-553,8751 -546,9866 | 2-510,5375 | 4-411,4204 850 150 
0-8 -6975,5118 | —-577,9216 | -475,4818| 4-503,0667 | 6-738,8238 900 100 
0-9 | -8204,3635 | -—-597,5916 | -363,4654 | 10-222,6204 | 13-467,2160 950 50 
0:95 | -8742,5455 | —-606,1358 | -268,4676 | 21-078,0376 | 26-340,9890 975 25 
1:00 | 1-0000,0000 | —-613,9616 | -000,0000 er) ore) 1,000 0 
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(ii) Samples of Four, n = 4. 


Here, if x = 7p, 


— Aa - p?)? d? = (— Ey ; 
Ya ar dx? Va > 
and if U = cos} (— 2)/V1 — 2, 
7 pif, dv 
then Ma ( ia {| a ale a +q(q-1) ie "a *U da : 
To find the first four moments about 7 = 0, we have to determine 
rtp +p +p 
| x?Udz, xU dz, Udz, 
. "|p: . 14) r =p) 
,adU 2 dU 2 dU dU 
Pa tee oe U, ae 8a us we, — 20U and wa — Uz 


The results are most briefly expressed by using Fisher’s notation p = sina and 
remembering that 


cos-1(— p)+ cos 1(p)=7, and a= cos! (— p) — 
For the purposes of integration we put = — cos @ and integrate in terms of 0. 
We find 


+p : 
x?Udx = 47 {a — cosa sin a}, 
‘=p 


rte ; 
| xU dx = 2 {sina — a cos a}, 
Jp 


p 
Udx = ma, 
—p 
and ins 
Pee) api ae 4 sin a 
xv costa cos a 
dU sin?a sinta sin2a 
2? — — 322U = 2 a -— 30——— 
di. cos?a | cosa cosa)’ 
a dU onU singa 5 sin a 
a, = 7 
dx cos’a cosa)’ 


dU ie ae sin2a 1 


3 a 3, a . 
cos?a cos? COs a 
Hence substituting in the several values of y,’, we obtain* 


TP Niny 2 {eota a (1 = COLA) crcavecccseseness (Ixxxvi), 


* Tested by formulae (Ixxx) and (Ixxxi), p. 364, which can be put in the forms 
(n = 2) (1 = 6) dy 
p dp 


Mg = by’ {(m — 2)/p® — (1 — 3) — AAA ree ecec ccc cc eee teeees (xe), 


and Ba = (nm — 2) — 3 (nm — 4) pe’ - tn} = 
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O41 oh — 2:CObAa/ = Da COLI a: ian cascaeoectesseieseerene 


1— (Ixxxvil), 
eo = {cot a + 6 cot®a + a (1 — 3 cot?a — 6 cot*a)} 

Seruahods (Ixxxviii), 
Uy = 1 — 4 cot?a — 6 cotta + a (6 cot®a + 6 cotea)* 
tee (Ixxxix). 


From these formulae were calculated the values given in the following table. 


TABLE III. Samples of Four. 
- Usual value 
easel aime er I enen pac ge 
1c 5 Vv 2 
of sampled | correlation One | Ms ie ei By Be 62/61 
population | in samples pase ees 
/n-1 
0-0 0 0 -577,3503 | -577,3503 0 1-800,000 ora 
0-1 -084,9678 | —-033,6268 | -574,5653 | -571,5768 031,429 1-:839,929 | 58-5418 
| 0-2 -170,4532 | —-065,2863 | -566,0965 | -554,2563 -129,510 1-964,665 | 15-1700 
| 03 -257,0089 | —-092,9620 | -551,5835 | -525,3887 -306,862 | 2-190,708) 17-1391 
0-4 -345,2652 | —-114,5383 | -530,3576| -484,9742 -589,510| 2-552,205| 4-3294 
0-5 -435,9911 | —-127,7520 | -501,3081 |) -433,0127 1-028,270| 3-116,256| 3-0306 
0-6 -530,1976 | —-130,1567 | -462,6087| -369,5042 1-728,423 | 4:-022,982| 2-3275 
0:7 -629,3378 | —:119,1407 | -411,1087) -294,4486 2-940,226| 5-609,288) 1-9078 
0:8 -735,7362 | —-092,1708 | -340,7311 | -207,8461 5:428,946| 8-922,221] 1:-6435 
0-9 -853,9806 | —-048,1281 | -236,6586| -109,6966 13-184,043 | 19-571,006| 1-4844 
0-95 -920,8889 | —-021,4678 | -157,7942 | -056,2917 | 29-8558 43-4082 1-4539 
0-98 -965,7599 | —-006,4363 | -088,2666 | -022,8631 87-5994 130-1935 1-4862 
0:99 -982,1321 —-002,4358 , °055,4859 | -011,4893 | 203-3250 311-7316 1-5332 
1-00 1 0 | 0 0 oe) oo) ae 


* These expressions cannot be applied to the case of p= 0. We must return to Equation (ix) and 
put p = 0, finding y, = $N or a horizontal line. 
+ The ratio B,/8, equals in the limit (277? — 256) (372 — 16)/(972 — 80)? = 1-8305. This may ke 


shown by putting a= 47 - c, or p= 1- }e2, or e= V1 — p?, where ¢ is small. We then find for 
values of p near unity 


256 aye 
3s) (1 et. 


256 
(9 - =) 


These lead to 


and give the above result. 
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(iv) General Case of small Samples, n> 4. 


Equations (xxviii), (xxxi), (xxxil), and (xxxiv) enable us to express the moment- 
coefficients of a sample of m + 2 in terms of those of a sample of n. But we have 
found algebraic expressions for the moment-coefficients for n = 3 and n= 4 in 
terms of (a) the complete elliptic integrals and logarithmic functions, (b) trigono- 
metrical functions of a and cota. Hence all even samples can have their moment- 
coefficients expressed in terms of a and cota, and all odd samples can have their 
odd moment-coefficients expressed in terms of the complete elliptic integrals and 
their even moments in terms of logarithmic functions. The former result has been 
already noticed by Fisher*. The arithmetical calculation of the successive moment- 
coefficients after n = 4 by the difference formulae is, however, shorter than obtaining 
the algebraical expressions and then substituting arithmetical values, and has 
been followed in our calculations. 


(10) Approach of the Distribution as n increases to a Normal Character. 


It is well known that for the “ probable error” to have meaning the distribution 
must approach the Gaussian for which £, = 0, 6B, =3. It is clear that these con- 
ditions are by no means fulfilled for samples of 25 or 50, whatever be the value 
of p. There is nearer approach in the low values of p in samples of 100, but there 
is considerable deviation for p = 5 and upwards. 


TABLE IV. Values of the Frequency Constants for the Correlation in 
Samples of 25. 


F Actual #*f vs A | 1- 
P mean mode wees t Actual o Vial Bi Bo 
0-0 0 0 0 -2041,241 | -2041,241 0 2-769,2305 
0-1 -0979,577 11173) | -11127 -2022,954 | -2020,829 -012,3106 | 2-791,6002 
| 0-2 -1960,288 +22258 -22177 -1967,883 | -1959,592 -049,8655 | 2-860,0511 
| 0:3 +2943,287 *33172 +33090 -1875,386 | -1857,530 -114,6242 | 2-978,8302 
0-4 °3929,765 ‘43840 | -43758 -1744,356 | -1714,643 -210,1771 | 3-155,8537 
| 0-5 -4920,974 54197 | -54149 °1573,152 -1530,931 342,3386 | 3-404,2283 
| 0-6 -5918,251 64194 -64190 -1359,499 — -1306,395 -520,2635 | 3-745,3432 
| 0-7 *6923,054 °73792 *73826 -1100,322 — -1041,033 °758,5549 | 4:214,8982 | 
| 0:8 -7937,001 *82966 *83025 -0791,481  -0734,847 | 1-081,1286 | 4:869,2635 _ 
| 0-9 -8961,933 -917038 -91736 °0427,345  -0387,836 | 1-533,4124 | 5-858,3872 | 
| 1:0 1 1 1 0 0 ore) ora 


It will be realised that while the ordinary value for the standard deviation of 
r and the distribution of by a normal curve is fairly close for samples of 400, there 
is still a quite sensible deviation from normality in the case of p = -8 or over. In 
fact it may be said that for the size of ordinary samples, there is always a sensible 


* Fisher, Biometrika, Vol. x. p. 516. 
+ See Sections (4) and (5) above. 
t= FH sy (Bs + 3)/1up (108, - 128, - 18)}. 
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TABLE V. Values of the Frequency Constants for 


Samples of 50. 


the Correlation in 


A | | 
; eee yfrom | 1- p? | 
Pearson’s | Actual —— | | 
p mean mode . PRES ey Sia V/n-1 | By Be 
: ‘< ce ae | 
0-0 0 0 0 -142,857 | -142,857 0 2-88236 
0-1 098,995 -1054 1053 -141,505 | -141,429 | 00666 289184 
0-2 -198,047 +2104 2102 137,439 | -137,143 | -02683 2-93350 
0-3 297,218 3147 3144 -130,634 | -130,000 | -06107 2-99909 
0-4 396,565 -4180 4177 -121,049 | -120,000 11041 | 309417 
0-5 -496,150 -5198 5196 -108,620 | -107,143 17635 3-22240 
0-6 596,038 6201 6199 093,260 | -091,429 -26110 | 338912 
0-7 696,295 | -7184 7183 074,878 | -072,857 | -36774 | 3-60222 
0-8 ‘796,989 | -8146 8146 *053,324 | -051,429 50037 3-87312 
0-9 898,198 9085 9085 028,434 | -027,143  -66608 4-22186 
1-0 1 1 1 0 0 oe) | io 
TABLE VI. Values of the Frequency Constants for the Correlation in 
Samples of 100. 
= | : Yr from = 
P Actual 7 i oreemersie 1-p 
p mean | mode Hace ET Vn—1 Bi Be 
ON ee ee | — z 
0-0 0 0 0 | :100,5038 | -100,5038 0 2-94060 
0-1 -099,5016 | -10258 -10255 =|: -099,5260 | -099,4987 00346 2:94736 
0-2 -199,0319 | -20499 20494 -096,5887 | -096,4836 01390 2-96774 
0-3 -298,6219 | -30708 30701 =| _ -091,6832 | -091,4584 03147 3-00213 
0-4 +398,3013 -40868 -40860 -084,7934 | -084,4232 05644 3:05118 
0-5 -498,1002  — -50964 50957 -075,8968 | -075,3778 08919 3-11583 
0-6 -598,0498 | -60982 -60976 :064,9640 | -064,3224 | -13025 3-19739 
0-7 -698,1815 | -70907 -70903 °051,9577 | -051,2569 18031 3-29767 
0-8 *798,5279 | -80726 80724 :036,8329 | -036,1814 | -24027 3-41896 
0-9 +899, 1225 | 90427 90423 -019,5352 | -019,0957 | -31148 3-57898 
1-0 1 | 1 1 0 | 0 | 0 oe 
TABLE VII. Values of the Frequency Constants for the Correlation in 
Samples of 400. 
lar f 
r Metuall (| vue eceeal : 1- 

P mean mode | Pe fe oar Vn—1 Ai Be 
0-0 0 | 0 0 -0500,626 | -0500,626 0 |  2-9850 
0-1 -0998,760 | -1006,250 | -1006,232 | -0495,654 | -:0495,620 | -00089 | 2-9868 
0-2 -1997,595 | -2012,116 -2012,082 | -0480,733 | -0480,601 | -00357 2-9921 
0-3 -2996,579 | -3017,217 | -3017,171 | -0455,851 | -0455,570 | -00804 3-0010 
0-4 -3995,788 | -4021,171 | -4021,115 | -0420,988 | -0420,526 01433 3-0138 
0-5 -4995,297 | -5023,602 | -5023,584 | -0376,115 | -0375,470 02250 3-0297 
0-6 -5995,181 | -6024,134 | -6024,089 | -0321,195 | -0320,401 03245 30498 
0-7 -6995,517 | -7022,401 | -7022,386 | -0256,183 | -0255,319 04435 3-0725 
0-8 -7996,380 | -8018,037 | -8018,013 | -0181,023 | -0180,255 05820 3-1017 
0-9 -8997,849 | -9010,725 | -9010,668 | -0095,653 | -0095,119 07402 3-1342 
1-0 1 1 1 0 0 ora ore) 


ee 


F + My (By + 3)/{ Hy (LOB, — 126, - 18)}. 
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deviation from normality for high values of p in the sampled population, and 
the usual “probable error of 7” must be treated with caution in these cases. 
Pearson’s curves would give better results, as is indicated by the agreement of 
¥ to five figures. 


(11) Table for determining the Mode ¥ of the Frequency Distribution of con- 
siderable Size n when the Correlation in the sampled Population is known to be p. 
The required value of the mode is 


et v; (p) ve(p) , 3 (p) 
Carman Gas t (a= 


when p is positive; if p be negative * has the same value as for p positive, but with 
opposite sign. 


TABLE VII. Functions required in determining the Mode 
of a large or fairly large Sample. 

p v1 (p) ¥2 (p) v3 (p) va (p) 

00 0 0 0 0 

05 -12468,75000 + -37872,26953 + 1-13249,90199 + 3-36913,85919 
“10 -24750,00000 + +74237,62500 +2-18002,51688 + 6-33154,83878 
15 -36656,25000 + 1-07636,49609 + 3-06395,34936 + 8-53428,44284 
20 -48000,00000 + 1-36704,00000 + 3-71804, 16000 + 9-72192,80640 
25 -58593,75000 + 1-60217,28516 + 4:09383,77380 + 9-77504,47690 
+30 -68250,00000 + 1-77142,87500 +4-16514,90563 + 8-72821,01391 
35 -76781,25000 + 1-86684,01172 +3-93125,67461 = +6-°77045,11170 
40 -84000,00000 + 1-88328,00000 +3:41856,48000 | +4-22414,17680 
“45 -89718,75000 + 1-81893,55078 + 2-68036,91046 + 1:50278,32653 
-50 -93750,00000 + 1-67578, 12500 + 1-79443,35938 — *94981,38428 
3) -95906,25000 + 1-46005,27734 + °85806,01815 — 2-73364,05345 
60 -96000,00000 + 1-18272,00000 — -01966,08000 — 3:57140,42880 
65 -93843,75000 + :85996,06641 — :72873,29366 — 3:38017,64160 
“70 -89250,00000 + :51363,37500 —1-17258,21188 | -—2-31971,93665 
‘75 -82031,25000 + -17175,29297 — 1-28614,42566 | — -79706,28440 
-80 -72000,00000 — +13104,00000 — 1-05802,56000 | + +59787,92160 
85 58968, 75000 — +35300,16797 — -55704,89418 + 1-25603,36730 
-90 -42750,00000 — :44481,37500 + :03650,10188 | + -84819,10191 
95 -23156,25000 — :34910,94141 + +39461,98756 | — +19874,71356 

1-00 0 0 0 0 


The above Table will give the value of * correctly to about the sixth figure if 
n = 100 or more, to about the fourth figure if m = 25 or more. Below 25 it can 
only serve as a “taking off point” for more accurate approximations, and these 
are fairly troublesome if n be very low. It will be found best to interpolate for 
the expression to be added to p. 


To find the modal 7 for samples of 9 when p = :2852. 
* =p ++106,272, for p = -25, 
¥ =p + -121,126, for p = -30. 


Illustration. 
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Hence * = p+ -116,729, for p = -2852 
= -4019, say. 
We cannot, however, be certain that this is correct to more than two figures. 
Equation (xlvi) gives us 7 = -4038. 


We will therefore start with * = -4030, say, as the basis of a more elaborate 
approximation, or po? = -1149372 say. 


Hence calculating J, and J, and using the difference formula we find 


1, = 1-6972,3599, I, = 1-0465,6745, 
I, = 1:2110,7453, T, = 1-0879,1327, 
I, = 1-0715,7031, T, = 1-1443,9624, 
I, = 1-0262,1201, I, = 1-2145,9528, 
I, = 1-0236,1262, To = 1-2980,8251. 


Thus the equality of Z, and J,) has not been reached, so that we could hardly 
anticipate (xlvi) giving a very good result. Using (xh) we find 


e = + -0006,0228, 


a sufficiently small correction, leading to pp? = -1155,3948, and * = -40512, correct 
to the fourth figure. Table VIII for n = 9 gives # in error by about 0°8 %. 


(12) Table for determining the ‘‘most probable” value p of the correlation in a 
sampled population from the knowledge of the correlation r in a sample of size n, when 
n is considerable and it is legitimate to distribute ignorance equally. 


The required value is 


ty Ay (7) Az (7) As (7) ‘ 
p n—-1 (n—1l)?® (n—1)’ 
when + is positive; if 7 be negative, p has the same value as for r positive, but with 


opposite sign. 


The above formula using Table IX will give p correct to five figures if nm = 25 or 
over, and correct to four figures if n = 10 or over. 


It appears best to interpolate not for the separate A-functions, but for the total 
value to be subtracted from r to find p. Thus, suppose we require to find f for 
y = -6781 and forn=16. We have for r = -65 


6 = 65 — -0127,3515, 
and for r = -70 6 = -70 — -0121,8204. 
0 


Therefore a difference of — -0005,5311 corresponds to a rise of -05 and accordingly 
one of — -0003,1085 to a rise of -0281. Thus 


p = °6781 — -0124,2430 
= :6657, accurately. 
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TABLE IX. Functions required in determining the “ most probable” 
value p of the Correlation. 

r A, (7) Ay (7) Ay (7) 
-00 0 0 0 
05 -02493,75000 — :00615,64453 — -00317,92000 
10 -04950,00000 —-01175,62500 — 00667, 19813 
15 -07331,25000 — -01626,62109 —-01073,47255 
20 -09600,00000 — -01920,00000 — -01551,36000 
25 -11718,75000 — -02014,16016 — -02099,99084 
30 -13650,00000 —:01876,87500 — :02699,79938 
“35 -15356,25000 — :01487,63672 — -03310,98746 
“40 -16800,00000 — -00840,00000 — -03874,08000 
“45 -17943,75000 + -00056,07422 — -04312,99059 
-50 -18750,00000 +-01171,87500 — -04541,01563 
55 -19181,25000 + :02457,59766 —-04470,17533 
-60 -19200,00000 + -03840,00000 — -04024,32000 
65 -18768,75000 +-05220,05859 — :03156,41987 
‘70 -17850,00000 + :06470,62500 — -01870,45688 
“75 -16406,25000 +-07434,08203 — -00248,33679 
-80 -14400,00000 + -07920,00000 +-01517,76000 

es) -11793,75000 +-07702,79297 + -03087,17070 
-90 -08550,00000 +-06519,37500 +-03926,26688 
“95 -04631,25000 + -04066,81641 + -03257,27752 

1-00 0 | 0 0 


(13) On 


the Table for qn, = | 
0 


wl 3 


sin’! ddd. 


While Table X, p. 377, gives the value of q¢, = | “sin"-1 ddd to ten figures, 
“ 0 


and therefore will be of use in calculating the values of the moments, the 
reader may be compelled to deal with values of n greater than those tabled, or 
even may need more than ten significant figures. The present values were 
calculated to twelve figures, by means of the simple relations 


P2042 = Top — 


ater, 
pea 2? 


PVon+41 = Yon-1 — 2p Vop-1> 


: , 1 : ‘ 
with the control relation q, X qep-1 = Bag 1 , and the occasional direct calcula- 


tion of individual q,’s to control the accuracy by use of Degen’s Tables* and 
Briggs’s Arithmetica Logarithmicat. 


(2p)! a 
Since Co Sane 5 See eer erie tr errr (xcil), 
22? {(sp) !}? ere? 
and oS OME a ee ee (xcin), 


* Tabularum ad faciliorem et breviorem probabilitatis computationem utilium Enneas. C. F. Degen, 
Havniae, 1824. 


+ Londini, W. Jones, 1624. 
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and Degen gives the logarithms of the factorials up to 1200! to eighteen mantissa 
figures, there is no difficulty in getting the logarithms of the q,’s to 18 figures. 


All we need is the value of either log 2 or 5 to an adequate number of places*. 


But for modern methods of machine calculation the logarithm is of small service 
and we need in this case to find also the antilogarithm. Let us illustrate the 
process in the determination of qyoq: 
log qio4 = 104 log 2 = 31-307119,549054,044280 
+ 2 log (52!) | + 135-813296,784409,541764 
— log 104 | —  2-017033,339298,780355 
— log (104!) J — 166-012795,764264,301069 
= 1-090587,229,900,504,620. 

Thus far the work is very straightforward. But to obtain the antilogarithm 
to twelve figures is another matter. Tables like the original Vega (to 10 figures) 
or Mendizabel (to 8 figures) are not of service. We are thus compelled to use 
Briggs’s 14 figure Table of Logarithms, but the fundamental defect of that magnifi- 
cent piece of work is the largeness of the differences. The nearest logarithm to 
the above is log (12319) = 1:090575,455222,21 with the remainder 

r = :000011,774678,29, 
and the difference -000035,252606,20. Mere linear interpolation gives 
-1231,9334,0087,32, 
which is wrong in the tenth figure. 
We have therefore used the method of inverse interpolation given in the 


Tables for Statisticianst as (vii)>is on p. xiv. Unfortunately there are two 
misprints in the value given there (corrected in the Errata); it should run 


624 (— Up — Uy + U_y + Uy) + OF (Sy — 38Uq — U_y — Us) + Uy — Up (A) = 


Be esc (xciv). 
But from Briggs’s tables, 


U_, = -0905,4019,9754,24, 
-0905,7545,5222,21, 
U1 = -0906,1070,7828,41, 
Uy = 0906,4595,7573,32, 


I 


Uo 


I 


whence 
-0001,4101,614786 0 = -0000,4709,8713,16 + 0? x -0000,00005723,06, 


-4709,8713,16 0000,5723,06 


= 1.4101,6147,86 ¢ ° 1-4101,6147,86" 


or, 6 


* ™ _ 1.570796,326794,8966 ; log 5 = -196119,877030, 1527. 


2 


+ Cambridge University Press, 1914. 
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To a first approximation 


0, = 


-4709,8713,16 (1 


( aoe) 
1-4101,6147 : 


1-4101,6147 


arranging it thus as most machines cannot divide by more than nine figures. 


Hence 
O; = -3339,9517,625. 


Substitute in the 6? term and we find 
6, = -3339,9970,356, 


or the value of q494 = °1231,9333,9997 to twelve figures. Found by the continuous 
process it was -1231,9333,9996, so that the value tabled is correct to the tenth 


figure. 


TABLE X. Values of the Integral q, = |’ sin” 1 ddd = le cos"! ddd. 
/0 “0 


n dn nN lk n | In | 

= — 

1 1-5707,9632,68 36 -2103,4114,55 71 -1492,6566,48 | 

2 1-0000,0000,00 37 -2074,4030,47 72 -1482,1822,53 | 
3 *7853,9816,34 38 -2046,5624,97 les -1471,9253,06 
4 -6666,6666,67 39 -2019,8134,93 74 -1461,8783,86 
5 -5890,4862,25 40 -1994,0865,35 75 *1452,0344,23 
6 -5333,3333,33 41 -1969,3181,56 76 -1442,3866,74 
ih -4908,7385,21 42 -1945,4502,78 77 -1432,9287,07 
8 -4571,4285,71 43 *1922,4296,29 78 -1423,6543,80 
9 -4295,1462,06 44 -1900,2072,49 79 *1414,5578,26 
10 -4063,4920,63 45 -1878,7380,46 80 -1405,6334,38 
Il -3865,6315,85 46 -1857,9804,21 81 | +1396,8758,53 
12 -3694,0836,94. 47 -1837,8959, 15 82 | -1388,2799,39 
13 -3543,4956,20 48 -1818,4489,23 83 -1379,8407,82 
14 -3409,9234,10 49 -1799,6064,16 84 =| --1371,5536,75 
15 -3290,3887,90 50 -1781,3377,19 85 | -1363,4141,06 
16 -3182,5951,83 51 -1763,6142,88 86 | -1355,4177,49 
17 -3084,7394,91 52 -1746,4095,29 87 | :1347,5604,54 
18 -2995,3837,01 53 +1729,6986,29 88 | -1339,8382,35 
19 -2913,3650,74. 54 -1713,4584,06 89 *1332,2472,67 
20 -2837,7319,28 55 -1697,6671,73 90 -1324,7838,73 
21 -2767,6968,21 56 -1682,3046,16 91 -1317,4445,19 
22 -2702,6018,36 57 -1667,3516,87 92 -1310,2258,08 
23 -2641,8924,20 58 -1652,7905,00 93 -1303,1244,70 
24 -2585,0974,08 59 -1638,6042,44 94 +1296, 1373,58 
25 -2531,8135,69 60 -1624,7771,02 95 -1289,2614,44 
26 -2481,6935,12 61 -1611,2941,74 96 +1282,4938,07 
27 -2434,4361,24 62 ~ -1598,1414,12 97 *1275,8316,37 
28 -2389,7789,37 63 -1585,3055,58 98 | +1269,2722,22 
29 -2347,4919,77 64 -1572,7740,88 99 | -1262,8129,47 
30 -2307,3727,67 65 -1560,5351,59 100 -1256,4512,90 
31 -2269,2422,44 66 *1548,5775,63 101 +1250,1848,17 
32 +2232,9413,87 67 -1536,8906,87 102 | +1244,0111,78 
33 -2198,3284,24 68 -1525,4644,65 103 =| ~=—--1237,9281,04 
34. -2165,2764,98 69 -1514,2893,53 104 -1231,9334,00 
35 -2133,6717,06 70 -1503,3562,85 105 +1226,0249,49 
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Something corresponding to the above process must be used for values of ¢, 
with n > 105, 1f we require the function correct to ten or twelve figures. The 
usual approximate formulae for the factorial, or for the I’-function, do not converge 
rapidly enough and generally give the logarithm, so that they really involve the 
use of the logarithm tables to 14 places and the antilogarithm process. 


For values of 100 and upwards the following formula will be found good: 
For n even we deduce by Stirling’s Theorem from Eqn. (xcii) 


1:2533,1413,7315 25  -03125 = -039,0625 = -0102,5390,625 
_ 12555 ALD TS (255 1808S 080 002 Se ee 


n n n3 nt 


ee 
For example, n = 100 
Gioo = 1:2533,1413,7315 x -1002,5031,64165, 
-1256,4512,9017,89, 


which is correct to twelve places. 


For n odd we deduce by Stirling’s Theorem from Eqn. (xcu) by somewhat 
more lengthy algebra precisely the same value. 


[Owing to the growth of this memoir far beyond its original limits, it has 
been found impossible to include in this first portion the experimental work 
which accompanied the algebraical investigations, nor to give illustrations of the 
various uses which the tables serve. These matters are therefore reserved for a 
continuation of this memoir, which will appear later. | 
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APPENDIX. 


CORRELATION IN SMALL SAMPLES. 
TABLE A. Ordinates and Constants of Frequency Curves* 


3°2363 


n= 3 p variate (correlation in population sampled). 
0 “l 2 3 4 7) 6 off 8 9 
— 1:00 0 oo ao ea oo or) oo 20 o | 
— 965 1019-41 | 874-99 | 744-21 | 624-87) 515-25 | 413-97} 319-92 | 232-19 | 150-04] 72-82 | 
— 90 730-25 | 631-33 | 54026] 456-01 | 377-73 | 304-70 | 236-31 | 172-05 | 111-49 54-25 | 
- +85 604-25 | 526-19 | 453-08 | 384-48 | 319-97 | 259-17 | 201-74 | 147-37 95-78 | 46-73 | 
- -80 530-52 | 465-35 | 403-21 | 344-03 | 287-69 | 234-02 | 182-85 | 134-03 87:38 | 42-75 | 
ao) 481-24 | 425-22 | 370-79 | 318-13 | 267-34 | 218-43) 171:35 | 126-04 82-44) 40-45 | 
- 70 445-72 | 396-74 | 348-18 | 300-44 | 253-76 | 208-27 | 164:05 | 121-12 79-49 | 39-12 
— 65 418-87 | 375-59 | 331-78 | 287-95] 244-48 | 201-60 | 159-47 | 118-20 77:84 38-44 
- 60 397-89 | 359-43 | 319-60) 279-03 | 238-18 | 197-36 | 156-81 | 116-70 77:14 38-22 
- 65 381-13 | 346-87 | 310-50 | 272-73 | 234-08 | 194-94 | 155-60 | 116-29 77-18 | 38°38 | 
- 30 367-55 | 337-02 | 303-74 | 268-44 | 231-71 | 193-97 | 155-57 | 116-79 77:82 | 38-85 
- 45 356-44 | 329-29 | 298-82 | 265-77 | 230-74 | 194-20) 156-54 | 118-06 79-02 39-61 
- -40 347:30 | 323-28 | 295-41] 264-44 | 230-97 | 195-48 |) 158-40 | 120-05 80-72 40-63 
— +35 339-80 | 318-70 | 293-29 | 264-28) 232-26 | 197-72) 161-09 | 122-71 82:91 | 41-93 
— -30 333-68 | 315-35 | 292-30} 265-17 | 234-53 | 200-85 | 164-58 | 126-05 85:60 | 43-51 
— +20 328-75 | 313-08 | 292-30 | 267-01 | 237-71 | 204-86 | 168-86 | 130-08 88-82 45:38 
— -20 324-87 | 311-77 | 293-24 | 269-76 | 241-79 | 209-74 | 173-97 | 134-82 92-59 47-57 
- 15 321-95 | 311-36 | 295-05 | 273-39 | 246-77 | 215-51 | 179-94] 140-33 96-98 50-12 
= whi) 319-91 | 311-80 | 297-71 | 277-91 | 252-66 | 222-22 186-82 | 146-69 | 102-04 53-08 
05 318-71 | 313-06 | 301-21 | 283-32 | 259-51 | 229-93) 194-72 | 153-99 | 107-88 56-51 
00 318-31 | 315-13 | 305-58 | 289-66 | 267-38 | 238-73 | 203-72 | 162-34 | 114-59 60-48 | 
+ -05 318-71 | 318-02 | 310-83 | 296-99 | 276-35 | 248-73 | 213-97 | 171-89 | 122-33 65-09 
+ 10 319-91 | 321-75 | 317-02 | 305-38 | 286-51 | 260-05 | 225-63 | 182-84 | 131-27 70:48 
cee i) 321-95 | 326-39 | 324-22 | 314-94] 298-02 | 272-89 | 238-91 | 195-41 | 141-63 76-79 
+ -20 324-87 | 331-99 | 332-52 | 325-79 | 311-04 | 287-45 |) 254-08 | 209-89 |) 153-71 84-24 
+ +25 328-75 | 338-66 | 342-06 | 338-10 | 325-78 | 304-00 | 271-45 | 226-66 | 167-88 93-11 
+ -30 333-68 | 346-53 | 353-00 | 352-08 | 342-52 | 322-88 | 291-45 | 246-18 | 184-60 | 103-75 
+ 3d 339-80 | 355-76 | 365-56 | 368-00 | 361-58 | 344-53) 314-60 | 269-08 | 204-53 | 116-66 
+ -40 347-30 | 366-59 | 380-02 | 386-21 | 383-43 | 369-48 | 341-59 | 296-15 | 228-52 | 132-54 | 
+ 45 356-44 | 379-33 | 396-75 | 407-18 | 408-63 | 398-48 | 373-30) 328-48 | 257-73 | 152-34 
+ -d0 367-55 | 394-39 | 416-27) 431-53 | 437-95 | 432-48 | 410-96 | 367-53 | 293-80 | 177-48 
+ +d5 381-13 | 412:36 | 439-27] 460-12| 472-45 | 472-82] 456-23} 415-36 | 339-09 | 210-05 
+ -60 397-89 | 434-08 | 466-77 | 494-15 | 513-63 | 521-35 | 511-47 | 474-92 | 397-07 | 253-32 
+ -65 418-87 | 460-81 | 500-25 | 535-44 | 563-68 | 580-81 | 580-15 | 550-63 | 473-12) 312-61 
+ :70 445-72 | 494-50 | 542-05 | 586-77 | 626-01 | 655-44 | 667-63 | 649-38 | 575-90 | 396-99 
TO 481-24 | 538-43 | 596-05 | 652-77 | 706-23 | 752-17 | 782-72 | 782-63 | 720-25 | 523-20 
+ -80 530-52 | 598-63 | 669-27 | 741-91] 814-50 | 883-48 | 941-21 | 971-06 | 933-85 | 724-94 
+ 85 604-25 | 687-68 | 776-82 | 871-73 | 971-83 | 1074-99 | 1175-29 | 1256-88 | 1274-71 | 1079-64 
ocd, 730-25 | 838-25 | 956-77 | 1087-46 | 1232-06 | 1391-88 | 1566-09 | 1745-85 | 1890-70 | 1804-95 
+ 95 1019-41 | 1180-31 | 1361-48 | 1568-00 | 1806-93 | 2088-24 | 2426-12 | 2839-55 | 3341-78 | 3802-31 
+ 1:00 a a 2 oo a ee) ee) oo o | 
| = = aes | | 
Mean 0000 ‘0786 | +1579 2384 3209 4063 4960 5919 *6976 8204 
Antimode — 0000 | — -1513 | — -2755 | — -3679 | — -4354 | — -4855 | — -5239 | — -5539 | — -5779 | — -5976 
o ‘7071 “7045 6966 6829 6629 6351 5977 5470 4755 | 3635 | 
(1 - p*)/Vn-1 ‘7071 -7000 6788 6435 5940 5303 4525 3606 2546 | = 1344 | 
By “0000 0280 1152 2722 5206 9018 | 1-5002 | 2-5105 | 4-5031 | 10-2226 
Bo 15000 | 1-5323 | 1-6329 | 1-8142 |) 2-1011 | 2-5423 4:4114 | 6-7388 | 13-4672 


* Tn all cases the total frequency of the curve is taken as 1000. 
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TABLE A. Ordinates and Constants of Frequency Curves. 


m= 4. p variate (correlation in population sampled). 
| 
0 “1 2 3 4 3) 6 7 8 9 
— 1:00 500-00 | 386-33 | 295-47 222-28 | 163-11 | 115-34 77-06 46-93 24-03 8-02 
- 95 500-00 | 390-84 | 301-85 228-97 | 169-22 | 120-41 80-89 49-50 25-46 8-53 
— -90 500-00 | 395-43 | 308-42 235-94 | 175-66 | 125-79 84:99 52:27 27-01 9-09 
— 85 500-00 | 400-09 | 315-19 243-21 | 182-44 | 131-52 89-38 55:26 28-69 9-69 
— -&0 500-00 | 404:83 | 322-17 250-79 | 189-59 | 137-61 94-10 58-49 30-52 10-36 
— 75 500-00 | 409-64 | 329-36 258-71 | 197-13 | 144-10 99-16 61-99 32-51 11-08 
- “70 500-00 | 414-54 | 336-78 266-97 | 205-09 | 151-02 | 104-61 65:79 34:69 11-89 
- 65 500-00 | 419-52 | 344-44 275-61 | 213-51 | 158-41 | 110-48 69:92 37-08 12:77 
— +60 500-00 | 424-58 | 352-34 284-64 | 222-42 | 166-31 | 116-82 74-41 39-70 13-75 
— 30 500-00 | 429-73 | 360-49 294-09 | 231-85 | 174:77-| 123-68 79-32 42-58 14-83 
— +00 500-00 | 434-96 | 368-90 303-97 | 241-84] 183-84 | 131-10 84-68 45-76 16-04 
- 45 500-00 | 440-28 | 377-59 314:32 | 252-44 | 193-57 | 139-16 90-56 49-28 17-39 
— +40 500-00 | 445-69 | 386-57 325-17 | 263-69 | 204-03 | 147-92 97-03 53-19 18-90 
— 36 500-00 | 451-20 | 395-84 336:54 | 275-65 | 215-29 | 157-46] 104-15 57-54 20-59 
— +30 500-00 | 456-80 | 405-43 348-48 | 288-38 | 227-44 | 167-88] 112-01 62-40 22-51 
— 26 500-00 | 462-50 | 415-34 361-01 | 301-94 | 240-55 | 179-28 | 120-72 67-85 24-68 
— +20 500-00 | 468-30 | 425-59 374-17 | 316-40 | 254-74 | 191-79 | 130-41 73-98 27-16 
—- 15 500-00 | 474-19 | 436-20 388-01 | 331-84 | 270-11 | 205-53 | 141-20 80-91 29-99 
— -10 500-00 | 480-20 | 447-17 402-57 | 348-34 | 286-81 | 220-69 | 153-27 88-77 33°25 
-— 05 500-00 | 486:30 | 458-53 417-89 | 366-00 | 304-96 | 237-43 | 166-83 97-73 37:02 
00 500-00 | 492-52 | 470-30 434-04 | 384-94 | 324-76 | 256-00] 182-11} 108-00 41-41 
+ +05 500-00 | 498-85 | 482-49 451:07 | 405-26 | 346-38 | 276-64] 199-40) 119-83 46-56 
+ +10 500-00 | 505-29 | 495-13 469-04 | 427-09 | 370-06 | 299-67 | 219-05 | 133-53 52-64 
+ +15 500-00 | 511-84 | 508-23 488-02 | 450-60 | 396-04 | 325-45] 241-50 | 149-52 59-89 
+ 20 500-00 | 518-52 | 521-81 508-08 | 475-94 | 424-63 | 354-41 | 267-28 | 168-29 68-60 
+ +20 500-00 | 525-31 | 535-90 529-31 | 503-31 | 456-16 | 387-08 | 297-05 | 190-49 79:17 
+ +30 500-00 | 532-23 | 550-53 | 551-79 | 532-91 | 491-05 | 424-09 | 331-63 | 216-99 92-15 
+ 3d 500-00 | 539-28 | 565-72 575°62 | 564:99 | 529-75 | 466-20 | 372-07 | 248-90 108-29 
+ -40 500-00 | 546-46 | 581-49 600-89 | 599-81 | 572-82 | 514-35 | 419-70 | 287-74 128-65 
+ 45 500-00 | 553-77 | 597-89 627-74 | 637-68 | 620-92 | 569-71 | 476-24 | 335-53 154-74 
+ +30 500-00 | 561-22 | 614-93 656-28 | 678-96 | 674-81 | 633-70 | 543-96 | 395-13 188-83 
+ +00 500-00 | 568-81 | 632-65 686-66 | 724-05 | 735-44 | 708-15 | 625-86 | 470-54 | 234-34 
+ ‘60 500-00 | 576-54 | 651-09 719-03 | 773-41 | 803-91 | 795-33 | 725-97 | 567-57 296-69 
+ ‘65 500-00 | 584-41 | 670-29 753°55 | 827-57 | 881-57 | 898-20] 849-83] 694-88 | 384-78 
+ 70 500-00 | 592-44 | 690-28 790-43 | 887-15 | 970-08 | 1020-55 | 1005-20 | 865-74 | 514-05 
+ 795 500-00 | 600-62 | 711-11 829-86 | 952-87 | 1071-44 | 1167-38 | 1203-12 | 1101-28 | 712-86 
+ +80 500-00 | 608-96 | 732-82 872-07 | 1025-55 | 1188-16 | 1345-36 | 1459-79 | 1436-61 | 1037-65 
+ 85 | 500-00 | 617-46 | 755-47 917-33 | 1106-18 | 1323-35 | 1563-48 | 1799-62 | 1933-28 | 1612-67 
+ 90 | 500-00 | 626-13 | 779-10 965-92 | 1195-91 | 1480-93 | 1834-16 | 2260-62 | 2706-28 | 2751-97 
+ 95 500-00 | 634-97 | 803-76 | 1018-17 | 1296-08 | 1665-90 | 2174-80 | 2904-38 | 3989-04 | 5425-82 
+ 1:00 | 500-00 | 643-98 | 829-53 | 1074-43 | 1408-32 | 1884-66 | 2610-44 | 3835-42 | 6309-30 |13781-45 
Mean -0000 0850 -1705 2570 +3453 -4360 -5302 -6293 “7357 *8540 
Mode Non-'existent 
o | ‘S774. | +5746 ‘5661 5516 5304 5013 -4626 ‘4111 +3407 -2367 
(l- p2)/Vn-—1| -5774 | +5716 | -5543 5254 4850 | -4330 | -3695 | -2944 | -2078 -1097 
1 | -0000 ‘0314 +1295 -3069 5895 | 1:0283 | 1-7284 | 2:9402 | 5-4289 | 13-1840 
Bs | 1-8000 | 18399 | 1:9647 | 2-1907 | 2-5522 | 3-1163 | 4:0230 | 5:6093 | 8-9222 | 19-5710 
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TABLE A—(continued). 


(—) p variate (correlation in population sampled). 
0 7 2 | 3 4 5 6 | 7 8 9 
| a 
— 1:00 00 00 -00 | -00 00 00 00 | -00 | -00 00 
= ES 198-78 | 141-35 99-03 | 67-80 44-87 28-25 16-49 | 8-50 3-48 80 
= WY) 277-50 | 200-54 | 142-43 98-66 | 65:96) 41-90) 24:65 |) 12-79 | 5:27 1-22 
= 85 335:36 | 246-33 | 177-39 12435 84:01 | 53:86) 31-94 16-70 | 6-92 1-62 
= oe) 381-97 | 285-19 | 208-27 | 147-80 | 100-93 65-32 39-06 20-58 | 8-59 2-02 
= 8) 421-08 | 319-60 | 236-74 | 170-10 | 117-44 76-75 | 46-30 24-59 10-33 2-45 
=) 454-64 | 350-80 | 263-63 | 191-85 | 133-96 88-43 53°84 28-82 12-20 2-91 
— 66 483-79 | 379-53 | 289-41 | 213-36 150-72 | 100-55 61-79 33°37 14-24 3-42 
= -60 509-30 | 406-24 | 314-40 | 234-88 | 167-91) 113-24 70-28 38-30 16-48 3:99 
— 05 531-68 | 431-25 | 338-80 | 256-56 | 185-68 | 126-63 79-40 43-68 18-96 4-62 
— +30 551-33 | 454-76 | 362-75 | 278-52 | 204-14] 140-84 89-26 49-59 21-72 5:34 
— 45 568-52 | 476-92 | 386-34 | 300-86 | 223-41 | 156-00 99-98 56-12 24-82 6-16 
— -40 583-47 | 497-83 | 409-64 | 323-65 | 243-58 | 172-23 | 111-67 63-37 28-32 7-10 
— 33° 596-35 | 517-56 | 432-68 | 346-96 | 264-76 | 189-65 | 124-47 71-45 32-27 8-17 
= -30 607-30 | 536-16 | 455-50 | 370-82 | 287-04 | 208-40 | 138-53 80-49 36-78 9-41 
- 25 616-40 | 553-64 | 478-09 | 395-29 | 310-51] 228-62 | 154-02 90-65 41-92 10-86 
20. 623-76 | 570-02 | 500-44 | 420-38 | 335-28 | 250-49 | 171-14] 102-08 47-83 12-54 
= 15 629-42 | 585-27 | 522-52 | 446-12 | 361-44] 274-17] 190-10] 115-02 54-64 14-52 
= kD) 633-43 | 599-37 | 544-31 | 472-50 | 389-09 | 299-86] 211-15 | 129-71 62-54 16-86 
- 05 635-82 | 612-28 | 565-73 | 499-53 | 418-31) 327-76 | 234-60 | 146-44 71-74 19-65 
00 636-62 | 623-95 | 586-71 | 527-18 | 449-20 | 358-10 | 260-76 | 165-58 82-51 22-98 
+ 05 635-82 | 634:31 | 607-16 | 555-41 | 481-84] 391-13 | 290-03 | 187-57 95-19 27-01 
aD) 633-43 | 643-27 | 626-97 | 584-15 | 516-30! 427-11 | 322-86 | 212-91] 110-22 31-91 
ap ul) 629-42 | 650-74 | 645-99 | 613-31 | 552-65 | 466-33 | 359-76 | 242-26 | 128-14 37:93 
20 623-76 | 656-59 | 664-06 | 642-76 | 590-91 | 509-09 | 401-33 | 276-40 | 149-65 45-39 
+ -25 616-40 | 660-68 | 680-97 | 672-33 | 631-08 | 555-72 | 448-27] 316-29 | 175-68 54-74 
+ +30 607-30 | 662-86 | 696-46 | 701-77 | 673-11 | 606-51 | 501-37 | 363-13 | 207-42 66-60 
+ +35 596-35 | 662-91 | 710-24 | 730-79 716-87 | 661-77 | 561-55 | 418-41 | 246-48 81-84 
+ -40 583-47 | 660-60 | 721-95 | 758-98 | 762-14 | 721-75 | 629-83 | 483-98 | 295-00 | 101-75 
+ 45 568-52 | 655-66 | 731-14 | 785-81 808-54 | 786-62 | 707-37 | 562-17 | 355-92 | 128-18 
+ :30 551-33 | 647-74 | 737-25 | 810-60 | 855-49 | 856-39 | 795-39 | 655-88 | 433-30 | 163-98 
+ -d5 531-68 | 636-41 | 739-59 | 832-42 | 902-10) 930-77 | 895-14 | 768-74 | 532-81 | 213-57 
+ -60 509-30 | 621-15 | 737-29 | 850-06 | 947-05 | 1009-03 | 1007-71 | 905-20 | 662-54 | 284-13 
+ 65 483-79 | 601-26 | 729-21 | 861-85 | 988-36 | 1089-62 | 1133-68 | 1070-61 | 834-18 | 387-69 
+ -70 454-64 | 575-83 | 713-82 | 865-50 | 1023-03 | 1169-66 | 1272-51 | 1270-96 | 1064-70 | 545-45 
+ °75 421-08 | 543-59 | 689-01 | 857-73 | 1046-50 | 1243-92 | 1421-02 | 1511-86 | 1378-85 | 796-89 
+ -80 381-97 | 502-63 | 651-67 | 833-60 | 1051-45 | 1302-93 | 1570-47 | 1795-27 | 1811-55 | 1220-21 
th Hef) 335°36 | 449-87 | 596-86 | 785-20 | 1025-58 | 1328-96 | 1699-62 | 2109-71 | 2406-01 | 1981-91 
+ -90 277-50 | 379-52 | 515-46 | 698-04 | 945-84 | 1286-02 | 1756-53 | 2398-74 | 3182-44 | 3457-80 
+ -95 198-78 | 277-21 | 385-59 | 538-03 | 757-76 | 1085-21 | 1595-22 | 2436-67 | 3917-02 | 6392-40 
+ 1:00 00 ‘00 00 | 00 00 -00 | 00 00 ‘00 | ‘00 
Mean -0000 0884 1773 2671 3584 4517.) +5480 -6482 “7541 8687 
Mode 0000 3264 5520 | -6936 7863 8500 8965 | -9318 | -9595 | -9817 
o 5000 4972 4886 | -4740 | -4528 4239 3858 3358 | -2691 | -1748 
(1 — p*)/Vn—1]| -5000 -4950 -4800 4550 -4200 3750 -3200 | +2550 -1800 0950 
By -0000 0315 | -1299 3077 | = -5909 | 1-0315 | 1:7297 | 2:9374 | 5-4065 | 13-0290 
2 20000 | 2:0429 | 2-1769 | 2-4201 | 2-8097 | 3:-4191 | 44027 | 6-1333 | 9-7830 | 21-7579 
| | 
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TABLE A. Ordinates and Constants of Frequency Curves. 


0% p variate (correlation in population sampled). 
0 | 2 3 4 7) 6 7 S 9 
— 1:00 00 | 00 00 00 00 00 00 -00 00. -00 
— 96 73-:125 | 47-28 30:03 | 18-55 10-99 6-12 3-10 1-35 “44 07 
= | 142-500 | 94-07 60:81 38-13 22-88 12-89 6-60 2-89 95 15 
— +85 | 208-125 | 140-28 92-30 58-76 35°74 20-37 10-54 4-66 1-54 -25 
- +80 | 270-000 | 185°83 124-49 80-50 49-64 28-64 14:97 6-68 2-23 36 
- 75 328-125 | 230-63 | 157-34 | 103-38 64-65 37:76 19-96 9-00 3-03 -50 
- -70 | 382-500 | 274-58 | 190-82 | 127-43 80-85 47-84 25-59 11-66 3°96 -66 
— 65 433-125 | 317-59 | 224-87 | 152-69 98-33 58-96 31-92 14-71 5:05 85 
- -60 | 480-000 | 359-54 | 259-44 | 179-18 | 117-16 71:24 39-06 18-20 6-32 1-07 
- 06 523-125 | 400-33 | 294-48 | 206-94 | 137-45 84-78 47-09 22-22 7:79 1-33 
- +00 | 562-500 | 439-82 | 329-89 | 235-97 159-29 99-73 56-16 26-83 9-52 1-64 
— 45 598-125 | 477-89 | 365-60 | 266-29 | 182-79 | 116-21 66-38 32-13 11-55 2-02 
— -40 630-000 | 514-41 | 401-49 | 297-90 | 208-03 | 134-39 77-92 38-25 13-93 2-46 
— 3d 658-125 | 549-23 | 437-46 | 330-80 | 235-13 | 154-44 90-95 45:31 16-73 3-00 
— +30 682-500 | 582-20 | 473-37 | 364-95 | 264-20 | 176-55 | 105-68 53°47 20-03 3-64 
— +26 703-125 | 613-16 | 509-07 | 400-33 | 295-32 | 200-93 | 122-35 62-92 23-94 4-41 
— +20 720-000 | 641-93 | 544:37 | 436-88 | 328-62 | 227-80 | 141-22 73-89 28-59 5:35 
- +165 | 733-125 | 668-33 | 579-08 | 474-49 | 364-17| 257-41 | 162-61 86-65 34-12 6-50 
- 10 | 742-500 | 692-18 | 612-97 | 513-07 | 402-05 | 290-01 | 186-88 | 101-53 40-75 7-91 
ee) 748-125 | 713-27 | 645-79 | 552-46 | 442-32 | 325-89 | 214-44] 118-92 48-71 9-65 
00 | 750-000 | 731-39 | 677-23 | 592-47 | 485-02 | 365-35 | 245-76 | 139-31 58-32 11-80 
+ 05 | 748-125 | 746-31 | 706-99 | 632-84 | 530-14 | 408-70 | 281-39 | 163-28 69-98 14-50 
+ -10 742-500 | 757-78 | 734-67 | 673-26 | 577-63 | 456-23 | 321-94 | 191-55 84-21 17-90 
+ -L6 733-125 | 765-56 | 759-87 | 713-34 | 627-35 | 508-26 | 368-14 | 224-98 | 101-67 22-24 
+ -20 720-000 | 769-38 | 782-10 | 752-61 | 679-10 | 565-04 | 420-76 | 264-66 | 123-24 27-81 
+ +25 703-125 | 768-94 | 800-84 | 790-47 | 732-53 | 626-81 | 480-72 | 311-90 | 150-07 35:06 
+ +30 682-500 | 763-96 | 815-48 | 826-21 | 787-15 | 693-69 | 548-97 | 368-33 | 183-69 44-60 
+ 3d 658-125 | 754-11 | 825-36 | 858-93 | 842-24 | 765-62 | 626-56 | 435-94 | 226-18 57-33 
+ -40 630-000 | 739-06 | 829-70 | 887-57 | 896-80 | 842:35 | 714-54 | 517-20 | 280-35 74-61 
+ 45 598-125 | 718-45 | 827-66 | 910-82 | 949-48 | 923-20] 813-87 | 615-10 | 350-06 98-48 
+ +30 562-500 | 691-90 | 818-27 | 927-11 | 998-43 | 1006-98 | 925-29 | 733-19 | 440-67 | 132-11 
+ +05 523-125 | 659-02 | 800-46 | 934-53 | 1041-18 | 1091-61 | 1048-92 | 875-62 | 559-69 | 180-65 
+ -60 480-000 | 619-39 | 772-99 | 930-76 | 1074-42 | 1173-81 | 1183-82 | 1046-92 | 717-69 | 252-62 
+ +65 433-125 | 572-56 | 734-52 | 913-01 | 1093-74 | 1248-41 | 1326-99 | 1251-36 | 929-54 | 362-77 
+ -70 | 382-500 | 518-05 | 683-50 | 877-89 | 1093-27 | 1307-53 | 1471-73 | 1491-35 | 1215-82 | 537-74 
+ +75 | 828-125 | 455-38 618-19 | 821-29 | 1065-22 | 1339-14 | 1604-80 | 1763-58 | 1603-56 | 828-01 
+ +80 | 270-000 | 384-02 | 536-66 | 738-25 | 999-24 | 1325-11 | 1701-20 | 2050-08 | 2122-58 | 1334-30 
+ +85 208-125 | 303-41 | 436-70 | 622-74 | 881-50 | 1237-97 | 1714-92 | 2297-17 | 2783-31 | 2265-98 
90 142-500 | 212-95 | 315-86 | 467-45 | 693-60 | 1036-09 | 1561-73 | 2364-83 | 3479-97 | 4043-96 
+ 95 73-125 | 112-04 | 171-33 | 263-48 | 410-83) 655-98 | 1086-58 | 1899-93 | 3578-10 | 7013-72 
+ 1:00 -00 00 | -00 -00 -00 -00 | -00 -00 -00 -00 
3 ae on 

Mean “0000 -0906 -1816 2734 3665 4614 5587 6594 -7646 8766 
Mode | 0000 -2197 4101 5599 ‘6747 -7630 8321 8870 9319 9689 
a | +4472 4444 -4360 -4216 -4007 3725 3356 2878 2253 -1397 
(1 -— p2)\/Vn—1| -4472 4427 -4293 -4070 3757 3354 2862 2281 -1610 -0850 
By “0000 “0304 1251 2959 -5667 -9838 | 16418 | 2-7599 | 4-9848 | 11-4554 
Bo 2-1429 | 2-1863 | 2-3222 | 2-5682 2:9615 | 3:5746 | 4:5585 | 6-2752 | 9-8417 | 21-1653 


r variate (correlation in sample). 


H. E. Soper, A. W. Youne, B. M. Cave, A. Lez, K. PEARSON 383 
TABLE A—(continued). 
n= p variate (correlation in population sampled). 
0 1 2 3 4 | 5 6 | 7% “8 Y) 
| | | 
— 1:00 00 00 00 -00 00 ‘00 | 00 | 00 00 -00 
- 95 25-84 15-19 8-75 4:87 2-58 1-27 56 20 “05 ‘Ol 
- 90 70-30 42-38 24-93 14-14 7-62 | 3°81 1-69 63 16 | “02 
-— +85 124-08 76:72 46-12 26-66 14-60 | 7:39 3°34 1-25 eoo) 04 
— +80 183-35 | 116-30 71-46 42-10 23-44 12-05 a51 2-08 | “56 | -06 
- 75 245-63 | 159-85 | 100-42 60-33 34:17 17-84 8:26 3°16 85 | 10 
- 70 309-15 | 206-44 | 132-64 81-28 46-85 24-84 11-67 4-53 1-24 | 14 
— 65 372-52 | 255-27 | 167-80 104-93 61-59 | 33-20 | 15-83 | 6-22 Wei 2.)| -20 | 
- -60 434-60 | 305-66 | 205-61 131-26 78-49 43:03 | 20-83 | 8-30 2-32 | -27 | 
— -bd 494-46 | 356-96 | 245-82 160-28 97:70 | 54:50 26-81 | = 10-85 3-08 | 37 
- +300 551-33 | 408-59 | 288-14 191-98 | 119-36 67-80 33:92 | 13-93 4-01 -49 | 
— 45 604:53 | 459-99 | 332-29 | 226-34) 143-61 83-12 42-31 | 17-66 5-16 63 
— -40 653-49 | 510-59 | 377-96 | 263-34 | 170-62 | 100-70 52-20 22-16 6-58 82 
- 3d 697-73 | 559-87 | 424-83 | 302-91) 200-54 | 120-78 63-81] 27-58 8-32 1-05 
- +30 736-85 | 607-28 | 472-53 | 344-98 | 233-54 | 143-64 77-42 | 34-10 10-48 | 1-35 
— 25 770-51 | 652-32 | 520-67 | 389-43 | 269-77 | 169-60 93:33 41-94 13-13 1-72 
— -:20 798-41 | 694-44 | 568-81 436-10 | 309-36 | 198-97) 111-92) 51-36) 16-41 2-19 
- +15 820-34 | 733-15 | 616-47 | 484-78 | 352-43 | 232-12] 133-60| 62-69| 20-47 2-80 | 
- 10 836-13 | 767-91 | 663-12 | 535-18 | 399-06 | 269-42 | 158-87 76:33 | 25-50 3°56 
- 05 845-65 | 798-24 | 708-17 586-95 | 449-29 | 311-28 | 188-29 92:77 31-77 4-55 
-00 848-83 | 823-62 | 750-99 | 639-65 | 503-10 | 358-10 | 222-51 | 112-60 39-60 5-82 
+ 05 845-65 | 843-56 | 790-87 | 692-72 | 560-37 | 410-29 | 26229] 136-56 49-43 7:48 
+ +10 836-13 | 857-59 | 827-06 745-49 | 620-88 | 468-23 | 308-46) 165-58 61-82 9-65 
+ +15 820-34 | 865-26 | 858-73 797-15 | 684-25 | 532-27 | 361-98 | 200-77 77-52 | 12-53 
+ -20 798-41 | 866-13 | 884-99 846-70 | 749-91 | 602-63 | 423-92 | 243-54 97:54 16-38 
+ +25 770-51 | 859-80 | 904-89 | 893-00 | 817-07 | 679-42 | 495-43 | 295-61] 123-21 21-58 
+ -30 736-85 | 845-93 | 917-44 934-67 | 884-59 | 762-49 | 577-72 359-10 | 156-38 | i38rAil 
+ +35 697-73 | 824-21 | 921-57 | 970-10} 950-96 | 851-33) 671-98 436-64) 19953) 38-62 
+ -40 653-49 | 794-40 | 916-22 | 997-43 | 1014-18 | 944-93 | 779-26 531-38) 256-18 | 52-62 
+ +45 604-53 | 756:38 | 900-29 | 1014-55 | 1071-64 | 1041-52 | 900-25 | 647-10 331-09 | 72-76 
+ -d0 551-33 | 710-11 | 872-71 | 1019-07 | 1120-03 | 1138-25 | 1034-91 788-14 | 431-03 | 102-38 
+ +08 494-46 | 655-69 | 832-49 | 1008-34 | 1155-12 | 1230-82 1181-85 | 959-18 | 565-53 | 147-00 
+ 60 434-60 | 593-44 | 778-80 | 979-51 | 1171-74 | 1312-87 1337-35 | 1164-60 | 747-90 | 216-12 
+ 65 372-52 | 523-87 | 711-02 929-65 | 1163-57 | 1375-33 | 1493-80 | 1406-95 | 996-60 | 326-69 
+ -70 309-15 | 447-82 | 628-96 855-90 | 1123-23 1405-52 1637-17 | 1683-53 | 1336-03 | 510-28 
+ +75 245-63 | 366-56 | 533-05 755-92 | 1042-49 | 1386-41 | 1743-33 | 1979-38 | 1794-82 | 828-27 
+ -80 183-35 | 281-92 | 424-74 | 628-49 | 913-07 | 1296-13 | 1772-80 | 2252-77 | 2393-94 | 1404-91 
+ +85 124-08 | 196-62 | 307-09 | 474-80 | 728-55 | 1109-20 1664-80 | 2407-27 3099-80 | 2495-12 
+ 90 70:30 | 114-82 | 186-02 | 300-94 | 489-11 802-95 | 1336-07 | 2244-07 | 3664-14 | 4555-85 
+ 95 25:84 43-51 13-17 124:05 | 214-20 | 381-46 712-23 | 1426-14 | 3147-79 | 7414-56 
+ 1:00 00 00 00 -00 00 | -00 00 -00 | 00 -00 
——|— = oF | Shales Oe 
| | | 
Mean ‘0000 | -0921 | -1845 Plea 3720 4678 | -5658 | -6667 | -7713 | -8814 
Mode “0000 ‘1813-3484 “4919 -6106 -7087 | -7894 | -8563 | -9122 9595 
o -4082 | -4055 | -3973 | -3832 | -3629 | -3356 | -3001 | -2545 | -1958 | -1175 
(1 - p2)(Vn -—1 -4082 4042 3919 3715 3429 3062 | 2613 2082 “1470 ‘0776 
1 “0000 0288 -1186 +2798 5340 9222 | 15263 | 2-5318 | 4-4611 9-6408 
Bs 2:2500 | 2-2929 | 2-4267 | 2-6686 | 3-0535 | 3-6503 | 4-5968 | 6-2238 | 9-5129 | 19-3424 
| es 


y variate (correlation in sample). 


384 Distribution of Correlation Coefficient in Small Sanples 


TABLE A. Ordinates and Constants of Frequency Curves. 


pis), p variate (correlation in population sampled). 
0 “1 “2 “3 “4 5) 6 7 8 of) 

— 1:00 00 00 | 00 -00 | 00 -00 00 00 00 00 
-— 965 8-91 4:76 | 2-49 1-25 | 59 26 10 03 -O1 -00 
— 90 33-84 18-63 | 9-97 5-12 | 2:47 1-10 42 13 -03 00 
- +85 72-19 40-95 22-48 11-80 | 5-82 2-62 1-03 33 07 ‘Ol 
— +80 121-50 71-02 40-02 21-48 | 10-80 4-95 1-98 63 13 ‘O01 
- 75 179-44 | 108-11 62-53 34:34 | 17-61 8-22 3°33 1-08 23 02 
- -70 243-84 | 151-45 89-96 50-58 | 26-48 12-58 5-19 1-71 38 03 
- 66 312-66 | 200-21 122-17 70-35 37-63 18-23 7:66 2-57 ‘57 05 
-— -60 384-00 | 253-56 158-99 93-82 51-30 25-35 10-84 3-69 83 07 
— 38 456-10 | 310-59 | 200-22} 121-13 | 67-75 34:18 14-89 5-16 1-18 10 
— 00 527-34 | 370-40 | 245-57 | 152-40! 87-25 44-96 19-98 7-06 1-64 “14 
— 45 596-26 | 432-05 | 294-69 | 187-72 | 110-08 58-00 26-31 9-47 2-25 19 
— -40 661-50 | 494-55 | 347-19 | 227-13 | 136-53 73-61 34:12 12-53 3-03 27 
-— 35 721-88 | 556-91 402-57 | 270-64 166-89 92-15 43-68 16-38 4-04 36 
- +30 776-34 | 618-14 | 460-28} 318-19 201-43 | 114-03 55-33 21-22 5-34 -49 
— +28 823-97 | 677-22 | 519-66 | 369-65 | 240-45) 139-67 69-47 27-27 7-03 -66 
— -20 864-00 | 733-11 579-99 | 424-79 | 284-17] 169-57 86-55 34:83 9-19 -88 
- 15 895-79 | 784-83 640-42 | 483-31 | 332-81 | 204-24 | 107-11 44-26 11-98 1-17 
- 10 918-84 | 831-37 700-04 | 544:75 | 386-52 | 244-24 | 131-79 56-00 15-57 1-57 
- 05 932-82 | 871-77 757-84 | 608-54 | 445-36 | 290-13 | 161-34 70-61 20-22 2-09 
-00 937-50 | 905-10 | 812-68 | 673-93) 509-27 | 342-52 | 196-61 88-81 26-24 2-80 
+ 05 932-82 | 930-49 | 863-38 | 740-00 | 578-06 | 401-96 238-59 | 111-47 34-07 3°77 
+ +10 918-84 | 947-15 | 968-63 | 805-60 651-31 | 468-99 | 288-43 | 139-69 44-29 5-08 
+ +16 895-79 | 954-37 947-08 | 869-36 | 728-36 | 544-03 | 347-39 | 174-87 57-69 6-89 
+ 20 864-00 | 951-55 | 977-31 | 929-66) 808-23 | 627-31 | 416-87 | 218-74 75-35 9-42 
+ +25 823-97 | 938-24 | 997-87 | 984-60) 889-50] 718-81 | 498-39 | 273-48 98-76 12:97 
+ -30 776-34 | 914-13 | 1007-32 | 1031-99 | 970-27 | 818-08 | 593-47 | 341-77 | 129-97 18-05 
+ +35 721-88 | 879-12 | 1004-28 | 1069-38 | 1048-02 | 924-04 | 703-52 | 426-94] 171-88 25-40 
+ -40 661-50 | 833-34 | 987-47 | 1094-05 | 1119-51 | 1034-75 | 829-63 | 532-99 | 228-55 36-23 
+ 45 596-26 | 777-15 | 955-78 | 1103-06 | 1180-66 | 1147-03 |. 972-16 | 664-66 | 305-76 52-50 
+ +50 527-34 | 711-26 | 908-44 | 1093-37 | 1226-48 | 1256-06 | 1130-10 | 827-21 | 411-69 77-48 
+ +35 456-10 | 636-70 | 845-06 | 1061-99 | 1251-03 | 1354-86 | 1300-15 | 1025-98 | 558-02 | 116-83 
+ -60 384:00 | 554-91 765-85 | 1006-21 | 1247-49 | 1433-64 | 1475-18 | 1265-10 | 761-16} 180-59 
+ -65 312-66 | 467-80 | 671-79 | 924-01 | 1208-46 | 1479-32 | 1642-00 | 1544-83 | 1043-60 | 287-38 
+ -70 243-84 | 377-81 564-92 | 814-59 | 1126-64 | 1475-21 | 1778-44 | 1856-10 | 1434-02 | 473-04 
+ +75 179-44 | 287-97 448-65 | 679-20 | 996-09 | 1401-54 | 1849-45 | 2169-84 | 1962-40 | 809-47 
+ +80 121-50 | 202-00 | 328-13 | 522-33 | 814-61 | 1237-97 | 1804-24 | 2418-01 | 2637-69 | 1445-38 
+ 85 72-19 | 124:36 | 210-79] 353-40 | 587-92 | 970-50 | 1578-45 | 2464-22 | 3372-91 | 2684-78 
+ 90 33°84 60-42 106-94 | 189-15 | 336-77 | 607-69 | 1116-43 | 2080-31 | 3769-68 | 5016-00 
+ 95 8-91 16-49 30-50 57:02 | 109-05 | 216-63 | 456-02 | 1045-85 | 2706-04 | 7661-16 
+ 1:00 00 -00 -00 00 -00 00 -00 -00 00 -00 
Mean -0000 0932 -1867 -2808 | -8759 4725 -5709- -6718 7760 *8847 
Mode -0000 1613 +3135 4505 5698 6722 “7594 8338 8975 +9524 
o -3780 +3753 3673 +3536 +3339 3074 2733 +2299 1746 -1024 
(1 - p2)/Vn—1| -3780 -3742 3628 3439 | -3175 2835 -2419 +1928 “1361 -0718 
By -0000 0272 “1117 +2630 -5000 *8586 | 1-4088 | 2:3044 | 3-9581 8-0434 
Bo 2-3333 | 2:3751 | 2-5051 | 2-7393 | 83-1101 | 3-6800 | 4:5751 | 6-0839 | 9-0368 | 17-2512 


y variate (correlation in sample). 


H. E. Soper, A. W. Youne, B. M. Cavz, A. Lez, K. Pearson 385 


TABLE A—(continued). 


u— 9 p variate (correlation in population sampled). 

0 “1 2 3 4 7) 6 7 8 9 
— 1:00 00 -00 00 00 00 00 00 -00 00 00 

- 95 3-02 1-47 69 31 13 05 02 00 -00 = 

- :90 16-03 8-06 3-92 1-82 ‘79 31 10 03 00 = 

- 85 41-32 21-50 10-78 5:14 2-28 =O 31 08 ‘01 = 

-— -80 79-21 42-66 22-04 10-78 4-89 2-00 ‘70 19 -03 = 
- 75 128-96 71:93 38-30 19-23 8-93 3-72 1-32 37 -06 -00 
- 70 189-20 | 109-29 60-01 30-95 14-72 6-27 2:27 “64 UL 01 
- 65 258-15 | 154-47 87-49 46-39 22-61 9-85 3-64 1-04 19 ‘01 
- :60 333-77 | 206-90 | 120-93 65-95 32-98 14-69 5:55 1-62 29 02 
- 06 413-87 | 265-84 | 160-41 90-03 46-21 21-08 8-14 2-42 45 -03 
- -d0 496-20 | 330-31 | 205-86) 118-99 62-73 29-33 11-58 3-52 66 “04 
— 45 578-53 | 399-18 | 257-08 | 153-13 83-00 39-81 16-09 4-99 96 06 
- -40 658-72 | 471-20} 313-71 | 192-70} 107-46 52-93 21-93 6-97 1-37 -09 
- +35 734-71 | 544-95 | 375-25 | 237-86 | 136-60 69-16 |. 29-41 9-57 1-93 12 
- +30 804-64 | 618-94 | 441-03 | 288-69 | 170-90 89-03 | 38-90 12-99 2-68 Il 
— +25 866-82 | 691-61 | 510-20] 345-14 | 210-81 | 113-15 50-86 17-45 3-70 *25 
— +20 919-77 | 761-33 | 581-74 | 407-03 | 256-77 | 142-16 65-83 23-24 5-06 “35 
- +15 962-26 | 826-48 654-46 | 473-98 | 309-16] 176-78 | 84-46 30-73 6-89 48 
10 993-32 | 885-42 | 726-99] 545-46 | 368-27} 217-80) 107-54 40-41 9-36 68 
- 05 1012-24 | 936-57 | 797-78 | 620-65 | 434-27 | 266-02 | 135-99 52-88 12-66 95 
-00 1018-59 | 978-46 | 865-14 | 698-50 | 507-13 | 322-29} 170-89 68-91 17-11 1:33 
+ 05 1012-24 | 1009-68 | 927-20} 777-65 | 586-61 | 387-41) 213-52 89-51 23-11 1-86 
+ -10 993-32 | 1029-05 | 982-02 | 856-41 | 672-14} 462-13) 265-33 | 115-94 31-22 2-63 
+ +15 962-26 | 1035-54 | 1027-55 | 932-73 | 762-75 | 547-03 | 327-98 | 149-85 42-24 3°73 
+ :20 919-77 | 1028-41 | 1061-74 | 1004-19 | 856-97 | 642-43 | 403-32 | 193-30 57-27 5-33 
+ -25 866-82 | 1007-20 | 1082-54 | 1068-00 | 952-68 | 748-20 | 493-28 | 248-93 77-88 7-67 
+ +30 804-64 | 971-79 | 1088-08 | 1121-0] | 1047-05 | 863-56 | 599-83 | 320-04 | 106-28 11-16 
+ +35 734-71 | 922-48 | 1076-68 | 1159-76 | 1136-34 | 986-80 | 724-70] 410-75 | 145-67 16-43 
+ -40 658-72.| 859-99 | 1047-01 | 1180-62 | 1215-85 | 1114-86 | 869-09 | 526-05 | 200-65 24-55 
+ +45 578-53 | 785-54 | 998-27 | 1179-91 | 1279-81 | 1242-93 | 1032-99 | 671-79 | 277-86 37-28 
+ -50 496-20 | 700-85 | 930-34 | 1154-15 | 1321-44 | 1363-83 | 1214-32 | 854-38 | 386-97 57-71 
+ +65 413-87 | 608-22 | 843-95 | 1100-46 | 1333-12 | 1467-50 | 1407-47 | 1079-97 | 541-89 91-38 
+ -60 333-77 | 510-46 | 740-94 | 1016-99 | 1306-81 | 1540-47 | 1601-27 | 1352-44 | 762-41 | 148-52 
+ 65 258-15 | 410-96 | 624-47 | 903-63 | 1234-97 | 1565-77 | 1776-21 | 1669-36 | 1075-58 | 248-83 
+ 70 189-20 | 313-58 | 499-21 | 762-81 | 1111-97 | 1523-66 | 1901-22 | 2014-01 | 1515-00 | 431-66 
+ 75 128-96 | 222-57 | 371-51 | 600-46 | 936-53 | 1394-27 | 1930-95 | 2341-13 | 2111-97 | 778-77 
+ +80 79-21 | 142-39) 249-41 | 427-14] 715-16 | 1163-62 1807-20 | 2554-55 | 2860-84 | 1463-91 
+ +85 41-32 77°38 | 142-36 | 258°83 | 466-86 | 835-66 | 1472-97 | 2482-96 | 3612-89 | 2844-14 
+ -90 16-03 31-28 60-49 | 116-98 | 228-19) 452-62) 918-20 | 1898-32 | 3818-02 | 5437-47 
+ +95 3-02 6-15 12-51 25-79 54:64 | 121-08 | 287-39 | 755-00 | 2290-26 | 7794-37 
+ 1:00 -00 -00 00 ‘00 00 00 | 00 -00 00 -00 
Mean -0000 0940 -1883 +2832 3789 -4760 ‘ST47 6756 -7793 -8869 
Mode -0000 +1492 2920 4238 +5420 6463 | -7374 8168 8861 9467 
o +3536 3510 +3431 3298 3107 2852 | +2524 2109 +1586 0915 
(1 - p2)/Vn—1| +3536 | -3500 | -3394 | -3217 | -2670 | -2652 | -2263 | -1803 | +1273 | -0672 
1 -0000 0256 “1051 -2468 4677 ‘7983 | 1-2989 | 2-0963 | 3-5152 | 6-7561 
Bo 2-4000 | 2:4403 | 2-5657 3°1456 | 3-6857 | 4:5239 | 5-9099 | 8-5326 | 15-3136 


2-7907 


386 Distribution of Correlation Coefficient in Small Samples 


TABLE A. Ordinates and Constants of Frequency Curves. 


r variate (correlation in sample). 


n= 10: p variate (correlation in population sampled). 
| 0 >is ao 3 “4 zo) 6 7 S 9 
— 1:00 | 00 00 -00 -00 00 -00 -00 00 00 -00 
a Wy) 1-01 “45 19 08 03 ‘O1 -00 00 = — 
— -90 7:50 O44 | 1-53 “64 25 09 03 01 = — 
- +85 23-37 11-15 oll 2-21 88 “31 09 -02 -00 = 
— -80 | 51-03 25:33 | 12-00 O34 2-19 80 “24 -06 ‘01 — 
— 7d Nee Olo9 47-29 23-18 10-64 4:47 1-67 “52 “12 02 == 
- 70 145-09 77:94 39-56 18-72 8-09 3°09 98 23 03 = 
— 65 210-66 | 117-77 61-92 30-23 13-43 5-25 1-71 “42 06 = 
— 60 286-72 | 166-85 | 90-90 45-82 20-95 8-41 2°81 ‘70 “10 -00 
- 3d 371-15 | 224-86 | 127-01 66-18 31-14 12-85 4-39 1-12 7 ‘01 
— +b0 | 461-43 | 291-10 | 170-55 91-82 44-57 18-91 6-63 1-73 26 01 
— -45 | 554-77 | 364-50 | 221-63 | 123-45 61-84 27-00 9-73 2-60 “41 -02 
- 40 648-27 | 443-69 | 280-14 | 161-56 83-58 37:60 13-93 3°83 ‘61 03 
— dd 739-03 | 526-99 | 345-68 | 206-59 | 110-50 51-29 19-56 5-52 “91 04 
— +30 824-22 | 612-48 | 417-62] 258-84 | 143-28 68-70 27-02 7:85 1:33 06 
-— +25 901-22 | 698-04 | 495-03 | 318-47 | 182-65 90-58 36-79 11-03 1-92 -09 
— +20 967-68 | 781-38 | 576-65 | 385-42 | 229-29] 117-77 49-48 15-32 2-76 13 
- 5 1021-57 | 860-14 | 660-97 | 459-39 | 283-82 | 151-22 65-82 21-09 3-92 -20 
— 10 1061-26 | 931-94 746-13) 539-76 | 346-77 | 191-94 86-73 28-82 5:55 -29 
~ 05 | 1085-57 | 994-42 | 830-01 | 625-59 | 418-49 | 241-06 113-28 39-13 7-83 “42 
00 | 1093-75 | 1045-39 910-20 | 715-49 | 499-09 | 299-70 146-80 52-84 11-02 62 
+ 05 1085-57 | 1082-81 | 984-11 | 807-66 | 588-33 | 369-02 | 188-84 71-03 15-49 “91 
+ 10 1061-26 | 1104-96 | 1048-94 | 899-79 | 685-53 | 450-06 | 241-23 95-11 21-75 1:35 
+ +15 | 1021-57 | 1110-49 | 1101-85 | 989-04 | 789-44] 543-65 | 306-06 | 126-91 30-57 1-99 
+ -20 | 967-68 | 1098-49 1139-99 | 1072-04 | 898-05 | 650-25 | 385-66 | 168-83 43-03 2-98 
+ +25 901-22 | 1068-59 | 1160-70 | 1144-96 1008-47 | 769-73 | 482-55 | 223-95 60-70 4-48 
+ +30 824-22 | 1021-01 | 1161-60 | 1203-51 | 1116-77 | 900-98 | 599-22] 296-23 85-91 6-82 
+ +35 739-03 | 956-66 | 1140-83 | 1243-14 1217-79 | 1041-59 | 737-88 | 390-62 | 122-04 10-51 
+ -40 648-27 | 877-13 | 1097-21 | 1259-23 | 1305-16 | 1187-27 | 899-90 | 513-21 | 174-12 16-44 
+ +45 554:77 | 784-74 | 1030-51 | 1247-45 1371-20 | 1331-27 | 1084-96 | 671-18 | 249-62 26-17 
+ -d0 461-43 | 682-54 | 941-67 | 1204-17 1407-27 | 1463-74 | 1289-77 | 872-31 | 359-56 42-50 
+ dd 371-15 | 574-23 | 833-02 | 1127-09 | 1404-16 | 1571-18 | 1506-11 | 1123-76 | 520-21 70-66 
+ -60 286-72 | 464-10 | 708-50 | 1015-97 1353-14 | 1636-20 | 1718-18 | 1429-27 | 754-96 | 120-77 
+ 65 210-66 | 356-82 | 573-73 | 873-46 | 1247-48 | 1638-20 | 1899-37 | 1783-34 | 1095-94 | 213-01 
a oD) 145-09 | 257-23) 436-02 | 706-04 | 1084-83 | 1555-63 | 2009-23 | 2160-46 | 1582-39 | 389-45 
+ +70 | 91:59 | 170-01 | 304-07 | 524-71) 870-38 | 1371-13 | 1993-01 | 2497-22 | 2247-23 | 740-79 
+ -80 | = 51-03 99-20 187-37 | 345-25 | 620-62 | 1081-22 | 1789-53 | 2668-18 | 3067-85 | 1466-04 
+ +85 23-37 47-59 | 95-03 | 187-37 | 366-47 | 711-33 | 1358-89 | 2473-52 | 3826-37 | 2979-24 
+ -90 7-50 16-01 33°82 71-51 | 152-84 | 333-27 | 746-59 | 1712-69 | 3823-56 | 5828-60 
+ 95 1-01 2-27 5:07 11-53 27-06 66:90 | 179-06 | 538-90 | 1916-67 | 7841-74 
+ 1:00 | -00 00 | -00 ‘00 00 00 00 -00 -00 00 
Mean | “0000 0946 -1896 +2850 3813 “4787 5776 6785 -7819 8887 
Mode | +0000 “1411 -2774 4050 -5219 6270 -7206 -8036 ‘8771 9422 
o 3333 3308 3232 3103 -2917 2671 *2355 1958 1461 0832 
(1 — p?)/ Vn—1| -3333 3300-3200 3033 2800 -2500 2133 -1700 -1200 0633 
By  -0000 "0242 | -0989 2317 4374 -7431 | 1-2002 | 1-:9122 | 3-1377 | 5-7475 
Be 2-4545 | 2-4933 | 2-6137 | 2-8292 | 3-1669 | 3-6774 | 4-4598 | 5-7290 | 8-0534 | 13-6667 


y variate (correlation in sample). 


H. E. Soper, A. W. Youne, B. M. Cavu, A. Len, K. Pkarson 387 


TABLE A—(continued). 


n= 11. p variate (correlation in population sampled). 
0 ail 2 3 4 7) 6 4 ‘8 Y 
- 1:00 -00 -00 -00 -00 -00 00 -00 -00 -00 -00 
— 95 “34 “14 05 -02 ‘O1 00 -00 = = — 
- 90 3-48 1-46 “59 22 -08 02 ‘01 00 = — 
-— 85 13-10 5:73 2-40 “94 | “34 slain) 03 ‘Ol — — 
- -80 32-59 14-90 6-47 2-63 | She) “31 08 -02 — a 
= ye) 64-48 30-82 13-91 5:84 | 2-22 ‘74 | 20 O04 00 — 
- :70 110-28 55-10 25°85 11-22 4-40 1-51 “42 09 ‘01 — 
- 65 170-38 89-00 43-43 19-52 7:90 2-78 80 V7 02 — 
- -60 244:13 | 133-37 67-72 31-55 13-19 4:77 | 1-41 30 04 — 
- +06 329-91 | 188-52 99-67 48-15 20-80 7-76 2:35 ‘51 06 = 
- +00 425-31 | 254-27 | 140-04 70-22 31:39 12-08 3°76 “84 10 00 
- 45 527-29 | 329-89 | 189-38 98:64 45-66 18-15 5:83 1:34 Aly ‘01 
— -40 632-37 | 414-10 | 247-94 | 134-25 64-43 26-48 8-77 2-08 | 27 ‘01 
- +35 736-81 | 505-13 | 315-62 | 177-84 88-59 37-70 12-90 3°16 43 ‘O1 
- +30 836-83 | 600-74 | 391-97 | 230-03 | 119-07 52-54 18-61 4:71 65 02 
— +25 928-73 | 698-31 | 476-06 | 291-27 | 156-85 71-87 26:38 6-91 Bh) 03 
- -20 1009-12 | 794-88 | 566-57 | 361-75 | 202-94 96-71 36-86 10-01 1-49 05 
= 1d 1074-98 | 887-29 | 661-66 | 441-31 | 258-26 | 128-20 50-84 14-35 221 -08 
- 10 1123-87 | 972-26 | 759-03 | 529-42 | 323-64 | 167-66 69-32 20-37 3:27 12 
- 05 1153-95 | 1046-54 | 855-92 | 625-00 | 399-73 | 216-51 93-53 28-70 | 4:80 “19 
-00 1164-10 | 1107-05 | 949-18 | 726-44 | 486-84 | 276-25) 124-99 40-16 | 7-04 -29 
+ 05 1153-95 | 1150-99 | 1035-30 | 831-45 | 584-86 | 348-45 | 165-55 55°88 | 10-29 “44 
+ -10 1123-87 | 1176-02 | 1110-55 | 937-05 | 693-05 | 434-44 | 217-39 77-34 15-02 68 
+ +15 1074-98 | 1180-37 | 1171-11 | 1039-52 | 809-89 | 535-54) 283-09 | 106-54) 21-93 1-06 
+ -20 ~1009-12 | 1163-01 | 1213-24 | 1134-42 | 932-85 | 652-40) 365-55 | 146-17 32-04 1-65 
+ +26 928-73 | 1123-74 | 1233-55 | 1216-69 | 1058-17 | 784-94 | 467-92 | 199-73) 46-90 2-60 
+ +30 836-83 | 1063-29 | 1229-19 | 1280-75 | 1180-68 | 931-80 | 593-39 | 271-79) 68-83 4-13 
+ +36 736-81 | 983-38 | 1198-19 | 1320-82 | 1293-66 | 1089-82 | 744-74 | 368-23 101-35 6-67 
+ -40 632-37 | 886-74 | 1139-72 | 1331-30 | 1388-77 | 1253-34 | 923-68 | 496-34 | 149-79 10-92 
+ 45 527-29 | 777-05 | 1054-45 | 1307-30 | 1456-28 | 1413-46 | 1129-63 | 664-75 | 222-30 18-21 
+ 50 425-31 | 658-86 | 944-78 | 1245-35 | 1485-58 | 1557-29 | 1358-02 | 882-89 | 331-21 31-02 
+ -65 329-91 | 537-38 | 815-03 | 1144-27 | 1466-08 | 1667-54 | 1597-69 | 1159-22 | 495-09 54-17 
+ -60 244-13 | 418-24 | 671-55 | 1006-07 | 1388-90 | 1722-77 | 1827-66 | 1497-43 | 741-14 97°35 
+ 65 170-38 | 307-09 | 522-51 | 836-93 | 1249-15 | 1699-11 | 2013-49 | 1888-67 | 1107-09 | 180-79 
+ -70 110-28 | 209-16 | 377-49 | 647-80 | 1049-15 | 1574-50 | 2105-04 | 2297-63 | 1638-62 | 348-37 
+ +75 64-48 | 128-73 246-69 | 454-52 | 801-88 | 1336-70 | 2039-33 | 2640-85 | 2370-71 | 698-67 
+ +80 32-59 68-50 | 139-53 | 276-64 | 533-90 | 995-96 | 1756-78 | 2762-99 | 3261-77 | 1455-70 
+ +85 13-10 29-01 62-88 | 134-46 | 285-17) 600-28 | 1242-87 | 2443-04 | 4017-99 | 3094-34 
+ -90 3°48 8-12 18-74 43-33 | 101-48 | 243-28 | 601-85 | 1532-03 | 3796-61 | 6195-13 
+ 95 “34 83 2:04 5-11 13-29 36-65 | 110-61 | 381-37 | 1590-43 | 7823-00 
+ 1:00 ‘00 00 -00 00 00 -00 00 00 00 -00 
Mean 0000 | -0952 | -1906 | -2865 | -3831 | -4808 | -5799 | -6807 | -7839 | .g900 
Mode 0000 1352 2667 3912 5066 “6121 “7074 -7930 8697 -9384 
6 3162 | -3138 | -3064 | -2938 | -2758 | -2519 | -2214 | -1833 | -1360 | .o767 
(l—p?)/Vn—1| -3162 313] 3036 2878 2656 2372 2024 1613 1138 “0601 
By 0000 0228 0932 2179 -4101 6933 | 1-1112 | 1-7516 | 2-8193 | 4-9565 
Bo 2-5000 | 2:5372 | 2-6527 | 2-8587 | 3-1798 | 3-6616 | 4:3914 | 5-5540 | 7-6188 | 12-2879 


388 Distribution of Correlation Coefficient in Small Samples 


TABLE A. Ordinates and Constants of Frequency Curves. 


y variate (correlation in sample). 


Teal p variate (correlation in population sampled). 
0 ‘1 2 3 4 7) 6 7 8 9 
— 1-00 -00 -00 -00 -00 -00 -00 -00 -00 -00 -00 
— 99 ‘ll “04 | ‘01 -00 -00 | 00 — — — — 
- -90 1-60 ‘61 22 08 02 01 -00 — — — 
- +85 7:30 2-93 1-12 “40 13 04 01 = = — 
— +80 20-67 | 8-71 3:47 1-28 43 12 03 00 — — 
- 75 45-08 19-94 | 8-29 | 3°18 1-10 “33 08 | ‘01 = = 
— 70 83-24 38-68 | 16-77 | 6-68 2-38 | ‘73 18 | 03 00 — 
— 65 136-86 66-80 30-25 12-52 4-62 1-46 37 07 ‘01 — 
— 60 206-44 | 105-87 50-10 21-57 8-24 2-69 “70 “13 ‘01 — 
— 300 291-24 | 156-97 77-68 34-81 13-80 | 4-65 1-25 | 23 02 —_ 
— -a0 389-33 | 220-58 | 114-20 53-33 21-95 | 7-66 | 2°12 | ‘41 04 —_ 
— 45 497-73 | 296-51 | 160-71 78-27 33°48 | 12-11 3-46 -69 07 = 
— -40 612-62 | 383-82 | 217-93 | 110-79 49-33 | 18-52 5-48 1-13 “12 —_ 
- 35 729-56 | 480-84 | 286-20 152-04 70:53 | 27-52 8:45 1-80 -20 00 
— +30 843-79 | 585-17 | 365-35 | 203-01 98-26 39-90 12-72 2-80 32 ‘01 
- +25 | 950-51 | 693-78 | 454-68 264-55 | 133-77 56-63 18-79 | 4:30 51 ‘01 
- -20 | 1045-09 | 803-05 | 552-83 337-18 | 178-37 78-86 27-27 6-49 -80 -02 
- 15 1123-41 | 908-99 | 657-79 | 421-03 | 233-38 | 107-94 39:00 9-69 1-24 03 
- 10 1181-98 | 1007-35 | 766-83 | 515-70 | 299-98 | 145-45 55-03 14-30 1:91 05 
— 05 1218-21 | 1093-82 | 876-57 | 620-13 | 379-19 | 193-12 76-70 20-91 2:93 08 
-00 1230-47 | 1164-30 | 983-02 | 732-48 | 471-64 | 252-88) 105-70 30-32 4-46 13 
+ 05 1218-21 | 1215-06 | 1081-67 | 850-05 | 577-41 | 326-69) 144-13 43-65 6-79 “21 
+ -10 1181-98 | 1243-05 | 1167-70 | 969-15 | 695-83 | 416-49 | 194-57 62-45 10-30 “34 
+ +15 1123-41 | 1246-03 | 1236-18 | 1085-08 | 825-16 | 523-93 | 260-05 88-83 15-62 56 
+ -20 1045-09 | 1222-86 | 1282-35 | 1192-20 | 962-35 | 650-07 | 344-11 | 125-68 23-70 ‘91 
+ +26 950-51 | 1173-63 | 1301-99 | 1284-05 | 1102-71 | 794-98) 450-63 | 176-90 35-99 1-49 
+ +30 843-79 | 1099-72 | 1291-80 | 1353-61 | 1239-72 | 957-09 | 583-60 | 247-67 54-78 2-49 
+ +35 729-56 | 1003-91 | 1249-80 | 1393-76 | 1364-85 | 1132-50 | 746-54 | 344-77 83-60 4-20 
+ -40 612-62 | 890-31 | 1175-76 | 1397-87 | 1467-64 | 1314-07 | 941-64 | 476-75 | 127-99 7-20 
+ -45 497-73 | 764-16 | 1071-56 | 1360-66 | 1536-09 | 1490-50 | 1168-15 | 653-92 | 196-63 12-58 
+ +30 389-33 631-64 | 941-41 | 1279-15 | 1557-56 | 1645-55 | 1420-17 | 887-55 | 303-03 22-49 
+ dd 291-24 499-45 | 791-98 | 1153-78 | 1520-32 | 1757-80 | 1683-36 | 1187-72 | 468-00 41-25 
+ -60 206-44 | 374-33 | 632-17 | 989-48 | 1415-90 | 1801-62 | 1930-97 | 1558-25 | 722-68 77:95 
+ 65 136-86 | 262-47 | 472-60 | 796-46 | 1242-33 | 1750-35 | 2120-06 | 1986-75 | 1110-84 | 152-41 
+ 70 83-24 | 168-90 | 324-59 | 590-32 | 1007-76 | 1582-81 | 2190-55 | 2427-07 | 1685-48 | 309-54 
+. 75 45-08 96-80 | 198-77 | 391-03 | 733-76 | 1294-33 | 2072-67 | 2774-00 | 2484-25 | 654-55 
+ +80 20-67 46-98 | 103-20 |. 220-15 456-19 | 911-23 | 1713-02 | 2842-00 | 3444-82 | 1435-84 
+ 85 7:30 | 17-56 41-32 95-84 | 220-41 | 503-13 | 1129-13 | 2396-80 | 4191-11 | 3192-59 
+ -90 1-60 | 4-09 10-31 26-08 66-93 | 176:39 | 481-91 | 1361-28 | 3744-80 | 6541-18 
+ 95 ‘11 | 30 81 2-25 6-48 19-94 67-87 | 268-10 | 1310-98 | 7752-85 
+ 1:00 00 | -00 -00 -00 | -00 | -00 -00 -00 -00 -00 
—— zie 
Mean 0000 | -0956 | -1914 | -2877 | -3847 | -4826 | -5818 | -6826 | -7855 | -8910 
Mode 0000 | -1309 | -2586 | -3805 | -4948 | -6004 | -6968 | -7843 | -8636 | -9353 
o 3015 | .2991 | -2919 | -2797 | -2622 | -2390 | -2096 | -1729 | -1276 | -0714 
(l- p*)/Vn—1] -3015 2985 2895 2744 2533 2261 -1930 1538 1085 0573 
By -0000 0215 0880 2054 3854 6487 | 1-0329 | 1-6118 | 2-5509 | 4-3322 
By 2-5385 | 2:5742 | 2-6848 | 2-8816 | 3-1870 | 3-6417 | 4:3231 | 5:3905 | 7-2330 | 11-1595 


tual 


y variate (correlation in sample). 
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TABLE A—(continued). 


n = 13. p variate (correlation in population sampled). 
0 al 2 3 4 O 6 7 ‘8 9g 
— 1:00 00 00 -00 -00 -00 00 00 -00 “00 -00 
- 95 “04 ‘01 00 -00 -00 -— — —- — = 
- -90 ‘73 26 09 03 ‘O1 00 oan = = — 
- +85 4-04 1-49 52 SIL7/ 05 01 | ‘00 == = = 
- +80 13-03 5:06 1-85 62 19 05 | ‘O1 — == = 
- 75 31-34 12-84 4:91 1-72 “54 “14 | 03 00 = = 
- 70 62-49 27-00 10-82 3°95 1-28 35 08 ‘Ol _ = 
- 65 109-33 49-86 20-96 7:99 2-69 “76 | 17 03 = = 
— ‘60 173-60 83-57 36:87 14-67 5°13 1-51) 35 05 00 — 
- 05 255-68 | 129-97 60-20 25-03 9-10 2-78 | 66 ‘ll ‘01 — 
— 00 354-43 | 190-29 92-61 40-28 15-26 4-83 | Tt9 20 “02 — 
- 45 467-24 | 265-03 | 135-62 61-76 24-42 8-04 2-05 35 03 — 
- -40 590-21 | 353-79 | 190-50 90-93 37-56 12-88 3-41 ‘61 05 — 
- 35 718:39 | 455-19 | 258-08 | 129-26 55°84 | 19-97 5°50 1-01 09 — 
- +30 846-13 | 566-86 | 338-66 | 178-18 80-64 30-14 8-65 1-66 15 — 
— +26 967-43 | 685-47 | 431:85 | 238-96 | 113-45 44-38 13-30 2-66 -26 00 
- :20 1076-39 | 806-84 | 536-45) 312-55 | 155-92 63-95 20-07 4:19 “42 ‘01 
- 6 1167-55 | 926-10 | 650-33 | 399-46 | 209-73 90-38 29-75 6-51 69 ‘Ol 
“10 1236-25 | 1037-95 | 770-44 | 499-56 | 276-51) 125-48 43-44 9-98 UeItiL -02 
- 065 1278-96 | 1136-93 | 892-78 | 611-89 | 357-72 | 171-31 62-54 15-15 1:77 04 
-00 1293-45 | 1217-76 | 1012-46 | 734-51 | 454-39 | 230-21 88-89 22-76 2-82 06 
+ 05 1278-96 | 1275-64 | 1123-90 | 864-29 | 566-93 | 304-67 | 124-80 33°91 4-45 10 
+ 10 1236-25 | 1306-66 | 1221-04 | 996-83 | 694-79 | 397-09 | 173-18 50-15 7:03 oH EP 
+ +6 1167-55 | 1308-10 | 1297-68 | 1126-41 | 836-11 | 509-76 | 237-58 73-66 11-07 -29 
+ -20 1076-39 | 1278-72 | 1347-93 | 1246-04 | 987-33 | 644-20 | 322-16 | 107-48 17-43 “50 
+ -25 967-43 | 1218-98 | 1366-66 | 1347-70 | 1142-82 | 800-73 | 431-61 | 155-83 27-47 “86 
+ +30 846-13 | 1131-13 | 1350-13 | 1422-76 | 1294-58 | 977-68 | 570-83 | 224-46 43-35 1-49 
+ +35 718-39 | 1019-23 | 1296-48 | 1462-65 | 1432-08 | 1170-41 | 744-26 | 321-04 68-58 2-63 
+ -40 590-21 | 888-97 | 1206-29 | 1459-72 | 1542-51 | 1370-21 | 954-71) 455-44 | 108-77 4-72 
+ 45 467-24 | 747-35 | 1082-96 | 1408-44 | 1611-41 | 1563-16 | 1201-40 | 639-76 | 172-98 8-65 
+ -d0 354-43 | 602-21 | 932-90 | 1306-67 | 1624-11 | 1729-32 | 1477-09 | 887-39 | 275-75 16-22 
+ +38 255-68 | 461-64 | 765-35 | 1157-00 | 1567-94 | 1842-84 | 1763-97 | 1210-31 | 440-01 31-24 
+ -60 173-60 | 333-18 | 591-84 | 967-83 | 1435-55 | 1873-82 | 2029-03 | 1612-76 | 700-88 62-08 
+ 65 109-33 | 223-11 | 425-11 | 753-80 | 1228-80 | 1793-32 | 2220-16 | 2078-63 | 1108-60 | 127-80 
+ -70 62-49 | 135-64 | 277-57 | 534-99 | 962-72 | 1582-52 | 2267-17 | 2549-96 | 1724-34 | 273-57 
+ 75 31:34 72-39 | 159-28 | 334-58 | 667-77 | 1246-49 | 2095-16 | 2898-15 | 2589-24 | 609-94 
+ +80 13-03 32-04 75-91 | 174:24 387-67 | 829-19 | 1661-34 | 2907-54 | 3618-62 | 1408-68 
+ 85 4:04 10-58 27-01 67:94 | 169-42 | 419-43 | 1020-26 | 2338-80 | 4348-30 | 3276-40 
+ -90 ‘73 2-05 5°65 15-61 43-90 | 127-20 | 383-80 | 1203-06 | 3673-97 | 6869-84 
+ 95 04 ‘ll “32 98 314 10-79 41-42 | 187-46 | 1074-87 | 7642-57 
+ 1:00 00 00 -00 00 -00 -00 00 -00 ‘00 -00 
Mean 0000 -0960 1921 2887 3859 4841 5834 6841 -7868 8919 
Mode -0000 1274 2522 3721 4853 5908 6880 “7772 8585 9326 
o 2887 2863 2793 -2674 | +2504 2279 1994 1640 1206 0671 
(1 — p?)/Vn—1| +2887 2858 2771 -2627 | -2425 -2165 1848 1472 -1039 0548 
B, -0000 0204 0833 1941 3631 6087 9636 | 1-4902 | 2:3236 | 3-8337 
Bo 2-5714 | 2-6057 | 2-7117 | 2-8999 | 3:1904 | 3-6201 | 4:2575 | 5-2404 | 6-8937 | 10-2454 


390 Distribution of Correlation Coefficient in Small Samples 


TABLE A. Ordinates and Constants of "Frequency Curves. 


r variate (correlation in sample). 


n= 14. p variate (correlation in population sampled). 
0 of 2 3 “4 7) 6 7 8 9 
— 1:00 -00 00 00 00 00 00 00 00 00 00 
— 95 ‘Ol 00 -00 00 — — —— — = = 
- 90 34 ‘ll -03 ‘Ol 00 — — = = — 
— 85 2-23 “715 24 07 -02 00 -- = — = 
- 8&0 8-18 2-93 98 30 08 02 00 — = = 
- 7d 21-69 8-22 2-90 93 26 -06 ‘Ol — =A = 
- 70 46-70 18-77 6-95 2-33 69 “17 03 00 = a 
- 65 86-94 37-04 14-45 5-07 1-55 -40 -08 ‘01 ad = 
- -60 145-33 65-68 27-00 9-93 | 3-17 84 17 “02 a= — 
- 3d 223-45 | 107-13 46-44 17-91 5-98 1-65 *35 05 00 — 
— +30 321-20 | 163-42 74:76 30-28 | 10-57 3-04 66 09 -O1 — 
— 45 436-63 | 235-83 | 113-94 48-51 17-73 5:31 1-21 18 ‘Ol = 
— -40 566-06 | 324-64 | 165-76 74:28 28-46 8-92 2-11 +32 02 — 
— 365 704:20 | 428-97 | 231-67] 109-39 44-02 14:43 3°57 ‘57 “04 = 
- 30 844-64 | 546-64 | 312-50 | 155-68 65-88 22-66 5:86 98 07 — 
— +25 980-21 | 674:21 | 408-32 | 214-86 95-78 34-62 9-38 1-64 13 — 
— +20 1103-62 | 806-97 | 518-19 | 288-41 | 135-67 51-63 14:70 2-69 -23 00 
- +16 1207-94 | 939-26 | 640-05 | 377-28 | 187-62 75°34 22-59 4-35 38 ‘01 
- +10 1287-18 | 1064-65 | 770-57 | 481-74 | 253-73 | 107-76 34-14 6-94 64 ‘Ol 
— 05 1336-68 | 1176-41 | 905-17 | 601-04 | 335-93 | 151-28 50-77 10-92 1-07 02 
00 1353-52 | 1267-92 | 1038-07 | 733-22 | 435-79 | 208-62 74-41 17:01 1-77 03 
+ 05 1336-68 | 1333-18 | 1162-50 | 874-80 | 554-12 | 282-80 | 107-57 26-23 2-91 05 
+ +10 1287-18 | 1367-32 | 1271-05 | 1020-68 | 690-62 | 376-89 | 153-45 40-10 4-77 09 
+ Ld 1207-94 | 1367-06 | 1356-11 | 1164-04 | 843-38 | 493-74 | 216-07 60-80 7-81 “15 
+ 20 1103-62 | 1331-09 | 1410-48 | 1296-43 | 1008-40 | 635-51 | 300-25 91-50 12-76 27 
+ +25 980-21 | 1260-37 | 1428-09 | 1408-13 | 1179-07 | 802-90 | 411-53 | 136-65 20-87 “49 
+ +30 844-64 | 1158-20 | 1404-74 | 1488-72 | 1345-78 | 994-23 | 555-84 | 202-51 34-16 -89 
+ dd 704-20 | 1030-12 | 1338-84 | 1528-05 | 1495-87 | 1204-16 | 738-66 | 297-60 56-01 1-64 
+ -40 566-06 | 883-63 | 1232-03 | 1517-46 | 1613-92 | 1422-35 | 963-63 | 433-15 92-02 3:08 
+ +45 436-63 | 727-61 | 1089-56 | 1451-35 | 1682-84 | 1632-02 | 1230-08 | 623-12 | 151-50 5-92 
+ +50 321-20 | 571-57 | 920-31 | 1328-79 | 1685-90 | 1809-23 | 1529-42 | 883-27) 249-81 11-64 
+ dd 223-45 | 424-77 | 736-29 | 1155-02 | 1609-82 | 1923-36 | 1840-21 | 1227-85 | 411-85 23-56 
+ 60 145-33 | 295-23 | 551-59 | 942-42 | 1448-96 | 1940-22 | 2122-58 | 1661-76 | 676-71 49-22 
+ 65 86-94 | 188-79 | 380-68 | 710-22 | 1209-98 | 1829-15 | 2314-64 | 2165-10 | 1101-47 | 106-69 
+ +70 46:70 | 108-45 | 236-30] 482-68 | 915-58 | 1575-19 | 2336-05 | 2667-21 | 1756-31 | 240-71 
+ 75 21-69 53:89 | 127-07 | 284-99 | 605-00 | 1195-09 | 2108-50 3014-47 | 2686-77 | 565-87 
+ +80 8-18 21-76 55:58 | 137-29 | 327-97 | 751-18 | 1604-06 | 2961-44 | 3784-47 | 1375-97 
+ +85 2-23 | 6-34 17-57 47:94 | 129-65 | 348-10 | 917-81 | 2272-14 4491-55 | 3347-70 
+ -90 34 1-02 3-08 9-30 28-67 91-32 | 304-31 | 1058-56 | 3588-66 | 7183-43 
+ +95 ‘O01 04 13 43 1-52 5-81 25-17 | 130-49 | 877-42 | 7501-01 
+ 1:00 00 00 00 “00 00 00 00 00 00 00 
Mean 0000 | -0963 | -1927 | -2896 | -3870 | -4853 | -5847 | -6854 | -7879 | -8926 
Mode 0000 | -1246 | -2470 | -3652 | -4775 | -5828 | -6808 | -7711 | -8541 | -9303 
o 2774 | -2751 | -2682 | -2566 | -2401 | -2182 | -1906 | -1564 | -1145 | -0634 
(1 - p2y/Vn - 1 “2774 2746 2663 2524 2330 -2080 “1775 “1414 -0998 -0527 
By -0000 “0194 -0790 1838 +3430 -5729 -9020 | 1-3838 | 2-1298 | 3:4290 
Bo 2-6000 | 2:6329 | 2-7346 | 2-9145 | 3-1912 | 3-5979 | 4:1955 | 5-1038 | 6-5961 | 9-4886 


r variate (correlation in sample). 
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TABLE A—(continued). 


n= 15 p variate (correlation in population sampled). 
0 al 2 3) “4 7) 6 “tf ‘S 9 
“ i ‘= rhe = : 
— 1:00 00 -00 -00 ‘00 00 00 00 005 00 -00 
= 95 00 005 -00 = — _— = 
— -90 15 04 O01 00 00 — = = = = 
— 85 1-22 “38 ‘ll 03 | ‘O1 00 = = = = 
- -80 5-12 1-69 “52 14 | 04 ‘O1 — = = a 
— 75 14-96 5-25 1-70 | “50 13 03 00 =— | = = 
=) lh 34:77 12-99 4-45 | 1-37 37 -08 ‘O1 — *), = — 
— 65 68-88 27-42 9-93 3-21 | ‘90 “211 O04 00 = == 
- -60 121-21 51-42 19-70 6-69 | 1-96 “47 ‘08 ‘O1 = = 
-— 05 194-55 87-97 35:70 | 12-77 3°91 ‘97 18 “02 — — 
- 60 289-98 | 139-82 60-13 22-68 7-29 1-90 37 05 00 — 
- 46 406-50 | 209-05 95-35 37-96 12-82 3-50 2a ‘09 ‘01 — 
- -40 540-85 | 296-76 | 143-69 60-46 21-49 6-15 1-30 SIIY/ Ol; — 
— 35 687-70 | 402-73 | 207-17 | 92-23 34:56 10:39 2:30 +32 02 — 
- 30 839-97 | 525-16 | 287-27 | 135-50 53-62 16-97 3°95 57 04 = 
- +26 989-42 | 660-62 | 384-61 | 192-47 80-56 26-90 6-59 1-00 | 07 = 
- -20 1127-28 | 804-07 | 498-68 | 265-12) 117-61 41-52 10-72 1-72 | 12 — 
- L5 1245-03 | 949-02 | 627-56 | 354-99 | 167-21 62-56 17-09 2-90 21 — 
- +10 1335-15 | 1087-93 | 767-80 | 462-81 | 231-94 92-19 26-73 4-80 37 00 
- 05 1391-74 | 1212-67 | 914-28 | 588-16 | 314-29 | 133-08 41-06 7:85 “64 ‘01 
00 1411-04 | 1315-18 | 1060-32 | 729-17 | 416-38 | 188-35 | 62-06 12-66 1-11 ‘O1 
+ 05 1391-74 1388-08 | 1197-90 | 882-11) 539-56 | 261-56 92-37 20-21 1-89 02 
+ +10 1335-15 | 1425-42 | 1318-13 | 1041-17 | 683-89 | 356-36 | 135-46 31-94 3°23 04 
+ -15 1245-03 | 1423-30 | 1411-83 | 1198-41 | 847-52 | 476-43 | 195-77 50-00 5-49 08 
+ :20 1127-28 | 1380-40 | 1470-37 | 1343-80 | 1026-06 | 624-58 278-78 77:60 9-31 15 
+ +26 989-42 | 1298-27 | 1486-66 | 1465-74 | 1211-89 | 802-05 | 390-92 | 119-38 15-80 28 
+ +30 839-97 | 1181-45 1456-06 | 1551-88 | 1393-76 | 1007-27 | 539-22 | 182-08 26-81 53 
+ +35 687-70 | 1037-21 | 1377-39 | 1590-37 | 1556-64 | 1234-25 | 730-36 | 274-85 45-57 1-02 
+ 40 540-85 | 875-02 | 1253-60 | 1571-56 | 1682-30 | 1470-94 | 968-99 | 410-40 77:56 2-01 
+ +45 406-50 | 705-73 | 1092-09 | 1489-95 | 1750-86 | 1697-55 | 1254-73 | 604-65 | 132-19 4-04 
+ -d0 289-98 | 540-45 | 904-49 | 1346-21 | 1743-50 | 1885-77 | 1577-71 | 875-91 | 225-46 8:33 
+ 65 194-55 | 389-38 | 705-68 1148-73 | 1646-64 1999-91 1912-58 1241-01 | 384-06 17-70 
+ -60 121-21 | 260-61 | 512-15 | 914-23 | 1457-04 | 2001-47 | 2212-18 | 1705-89 | 650-97 38°88 
+ 65 68-88 | 159-16 | 339-61 | 666-66 1187-01 | 1858-75 | 2404-17 2246-82 | 1090-34 88-74 
+ -70 34:77 86-37 | 200-40 | 433-86 | 867-51 | 1562-05 | 2398-10 | 2779-52 | 1782-27 | 211-01 
+ 75 14-96 39-97 | 100-99 | 241-85 | 546-09 | 1141-54 | 2114-05 | 3123-86 | 2777-70 | 523-05 
+ -80 5:12 14-72 40-55 | 107-77 | 276-43 | 677-98 | 1543-04 | 3005-21 | 3943-34 | 1339-09 
+ +85 1-22 3°79 11:39 | 33-71 98-85 | 287-83 | 822-59 | 2199-24 | 4622-48 | 3408-02 
+ -90 15 51 1-67 5:52 | 18:65 | 65-32 | 240-39 | 927-98 | 3492-48 | 7483-96 
95 00 ‘01 05 ole “73 | 3-12 15-23 | 90-51 | 713-62 | 7335-23 
+ 1:00 00 -00 | ‘00 00 00 | 00 00 ‘00 | 00 00 
Mean 0000-0965 | -1932 | -2003 | -3879 | -4864 | -5858 | -6865 | -7888 | -8932 
Mode 0000 | +1224 | -2429 | -3595 | -4710 | -5761 | -6745 | -7659 | -8504 | -9283 
o -2673 2650 | *2583 -2470 | -2309 2096 +1828 1496 1093 “0602 
(1 -- p?2)/Vn -1] -2673 2646 +2566 *2423 | +2245 2004 -1710 1363 “0962 0508 
Fi 0000 ‘0184 | -0751 1745 3248 5407 8473 | 1-2904 | 1:9635 | 3-0956 
b 2:6250 | 2:6566 | 2-7542 | 2-9265 | 3-1904 | 3-5759 | 4-1375 | 4-9799 | 6-3347 | 8-8548 


r variate (correlation in sample). 
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TABLE A. Ordinates and Constants of Frequency Curves. 
i— 1G. p variate (correlation in population sampled). 
0 “1 2 3 “4 7) 6 7 S$ 9 
— 1:00 00 00 -00 00 -00 -00 00 -00 00 -00 
- 95 -00 — — = = = — — = ae 
- -90 “O07 02 00 -00 me oe = = — te 
- 85 67 19 05 ‘O1 00 = = 
— SO 3-19 07 27 07 02 00 = = an Say 
- 75 10-28 3°34 1-00 27 -06 ‘Ol 00 — ia == 
- 70 25:80 8:97 2-84 “80 “19 “04 01 — = = 
- 65 54:39 20-23 6-80 2-02 51 “1 02 — = = 
- 60 100-76 40-12 14-33 4-50 1-20 26 “04 00 = = 
- 65 168-85 | 72-01] 27:35] 9-08 2:55 ‘BT 09 Ol) — — 
— -50 260-97 | 119-24] 48-20] 16-93 5:01 1-18 -20 02|  — _ 
— 45 377-23 | 184-72 | 79-55 | 29-61 9-24 2:29 41 05) — — 
— -40 515-11 | 270-42 | 124-16 49-05 16-17 4-23 80 09 00 — 
- 35 669-43 | 376-88 | 184-68] 77-51] 27-05 | 7-46 1-48 18 01; — 
- +30 832-67 | 502-91 | 263-24 | 117-56 43-50 12-67 2-65 “34 -02 — 
- 25 995-53 | 645-25 | 361-12] 171-85| 67:54 | 20:84] 4-61 61 03} — 
— +20 1147-76 | 798-62 | 478-36 | 242-94 | 101-63 33-28 7-80 1-10 -06 — 
- +16 1279-16 | 955-82 | 613-35 | 332-95 | 148-55 51-78 12-89 1-93 12 ~- 
- -10 1380-50 | 1108-16 | 762-59 | 443-20 | 211-35 78-63 20-86 3-32 “21 — 
= 05 1444-45 | 1246-06 | 920-58 | 573-72 | 293-10] 116-70] 33:10] 5-62 38 | — 
:00 1466-31 | 1359-84 | 1079-59 | 722-83 | 396-57 | 169-51 51-59 9-40 69 00 
+ 05 1444-45 | 1440-62 | 1230-45 | 886-63 | 523-71 | 241-10] 79-06 | 15-52 1-23 01 
+ +10 1380-50 | 1481-25 | 1362-60 | 1058-69 | 675-07 | 335-89 | 119-19| 25:36) 2-18 02 
+ +15 1279-16 | 1477-14 | 1465-15 | 1229-86 | 848-97 | 458-26 | 176-81 40-99 3°84 04 
+ 20 1147-76 | 1426-97 | 1527-93 | 1388-46 | 1040-70 | 611-89 | 258-02 | 65-60] 6-77 08 
+ +25 995-53 | 1333-04 | 1542-70 | 1520-86 | 1241-67 | 798-66 | 370-16 | 103-97 11-92 -16 
+ -30 832-67 | 1201-32 | 1504-45 | 1612-58 | 1438-86 | 1017-24 | 521-43 | 163-10) 20-98 31 
+ +35 669-43 | 1041-01 | 1412-54 | 1649-98 | 1614-73 | 1261-08 | 719-87 | 253-03 36-96 63 
+ -40 515-11 | 863-74 | 1271-49 | 1622-42 | 1748-02 | 1516-38 | 971-31 | 387-62 65-16 1-30 
+ 45 377-23 | 682-33 | 1091-14 | 1524-73 | 1815-85 | 1760-11 | 1275-84 | 584-87) 114-98 | 2-74 
+ +30 260-97 | 509-39 | 886-11 | 1359-54 | 1797-36 | 1959-32 | 1622-38 | 865-86 | 202-85 5:94 
+ +65 168-85 | 355-79 | 674-19 | 1138-85 | 1678-96 | 2072-94 | 1981-54 | 1250-36 | 357-03 13-25 
+ 60 100-76 | 229-32 | 474-01 | 884-08 | 1460-52 | 2058-14 | 2298-30 | 1745-69 | 624-23 | 30-62 
+ +65 54:39 | 133-74 | 302-02 | 623-79 | 1160-78 | 1882-87 | 2489-31 | 2324-29 | 1075-93 73-57 
+ -70 25-80 68-58 | 169-43 | 388-74 | 819-36 | 1544-13 | 2454-04 | 2887-46 | 1802-95 | 184-41 
+. 75 10-28 29-55 80-00 | 204-59 | 491-36 | 1086-96 | 2112-96 | 3227-08 | 2862-73 | 481-97 
+ -80 3-19 9-92 29-49 84:33 | 232-25 | 609-99 | 1479-67 | 3040-08 | 4096-07 | 1299-15 
+ +85 67 2:25 7:36 23-62 75-13 | 237-24 | 734-93 | 2121-99 | 4742-40 | 3458-65 
+ -90 07 25 -90 3:27 12:09 |. 46:58 | 189-31 | 810-97 | 3388-31 | 7772-88 
+ 95 00 00 “02 “08 35 1-67 9-19 62-58 | 578-60 | 7150-91 
+ 1-00 -00 -00 -00 -00 -00 :00 -00 -00 -00 -00 
Mean 0000 0968 -1937 2909 +3888 4873 5868 *6875 -7896 8937 
Mode -0000 -1206 2394 3548 4655 5705 6692 ‘7614 8471 9265 
o 2582 2560 +2495 2384 +2227 2020 -1759 -1437 -1047 0575 
(1 - p2)/Vn -1| -2582 2556 | +2479 2350 -2169 -1936 1652 -1310 0930 0491 
By -0000 0176-0716 -1660 +3083 5117 ‘7983 | 1-:2080 | 1-8195 | 2-8181 
Bo 2-6471 | 26775 | 2-7712 | 2-9363 | 3-1883 | 3-5545 | 4:0836 | 4-8677 | 6-1046 | 8-3239 


ry variate (correlation in sample). 
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TABLE A—(continued). 


lake p variate (correlation in population sampled). 
| 
0 “1 2 3 | 6 7 8 9 
— 1:00 00 -00 -00 -00 00 -00 00 -00 -00 -00 
- 95 00 -00 = = See A Soe a ~s = = 
- 90 03 ‘01 -00 ae a ae —_ one ay VS eee 
eS) “37 10 02 -O1 00> — = 
= bl) 1:98 “56 14 -03 ‘Ol -00 ae = = — 
Sie 7-05 2-12 58 “14 03 -O1 — — = = 
= 0 19-10 6-17 1-81 ‘47 -10 | 02 00) — — — 
- 65 42-84 14-88 4-64 1:27 29 05 AOL = — —- 
=) 60 83:54 31-23 10:39} 3-02 | -74. | “14 0 — — — 
= ai) 146-13 | 58:78 | 20-89 6-43 1:66 34 05 00 _ -— 
= aNd) 234-22 | 101-41 38-54 12-61 3-43 “74. Ls -O1 — -- 
= off 349-12 162-78; 66:18) 23-04 6-64 1-50 “24 02 — — 
Sl) 489-26 | 245-73 | 106-99 39-68 12:14 2-90 “49 | 05 — — 
= 3Bi5) 649-87 | 351-73 | 164-17 64:96 | 21-11 5:33 95 | -10 00 — 
- 30 823-17 | 480-28 | 240-55 | 101-72] 35-19| 9-43 1:78 20 Ol = 
- +25 998-93 | 628-51 | 338-14| 153-03 | 56-47 16-10 3-22 37 02 = 
- :20 1165-43 | 791-03 | 457-61 | 222-01 87-57 26-61 5-66 | -70 03 = 
- 15 1310-64 | 960-04 | 597-82 311-43 131-60] 42-74 9-69 | 1:28 -06 -- 
- 10 1423-48 | 1125-69 | 755-35 | 423-25 | 192-06 66-87 16-23 2-28 “12 = 
- 05 1495-05 | 1276-87 | 924-29 | 558-10) 272-60 | 102-06 26-61 4-01 23 — 
00 1519-58 | 1402-18 | 1096-21 | 714:59 | 376-67) 152-13 42-77 6-95 -43 00 
05 1495-05 | 1491-07 | 1260-42 | 888-75 | 506-94 | 221-67 67-49 | 11-89 79 ‘O01 
ark 1423-48 | 1535-05 | 1404-71 | 1073-56 | 664-54 | 315-72 | 104-60 20-08 1-46 -O1 
+ 15 1310-64 | 1528-81 | 1516-33 | 1258-69 | 848-09 | 439-58 | 159-26 33-51 2-68 -02 
+ +20 1165-43 | 1471-07 | 1583-40 | 1430-69 | 1052-66 | 597-82 | 238-16 | 55-31 4-91 O04 
+ +25 998-93 | 1365-00 | 1596-49 | 1573-74 | 1268-71 | 793-12 | 349-55 90-30 8-97 09 
+ -30 823-17 | 1218-19 | 1550-21 | 1671-08 | 1481-38 | 1024-51 | 502-86 | 145-74 16-37 18 
+ +35 649-87 | 1041-98 | 1444-64 | 1707-15 | 1670-43 | 1284-98 | 707-60 | 232-3] 29-89 “39 
ap LY 489-26 | 850-26 | 1286-11 | 1670-36 | 1811-36 | 1558-97 | 970-98 | 365-12) 54-60 “84. 
+ 45 349-12 | 657-90 | 1087-22 | 1556-06 | 1878-12 | 1820-03 | 1293-78 | 564-21 | 99-74 1:86 
+ -d0 234-22 | 478-81 | 865-73 | 1369-26 | 1847-84 2030-22 | 1663-80 | 853-62 | 182-02 4-22 
+ +00 146-13 | 324-22 | 642-35 | 1125-98 | 1707-27 | 2142-80 | 2047-42 | 1256-38 | 331-00 9-90 
+ :60 83-54 | 201-23 | 437-52 | 852-59 | 1460-03 2110-68 | 2381-31 | 1781-59 | 596-99 24-05 
+ 65 42-84 | 112-08 | 267-85 | 582-09 | 1132-06 | 1902-13 | 2570-49 | 2397-95 | 1058-86 60-84 
+ -70 19-10 54-30 | 142-84 | 347-36! 771-78 | 1522-30 | 2504-51 | 2991-52 | 1818-97 | 160-72 
+ 7d 7-05 | 21:79) 63-21 | 172:59| 440-91 | 1032-18 | 2106-17 | 3324-73 | 2942-45 | 442-92 
+ 80 1-98 6-67 | 21:38 | 65-80| 194-61 | 547-33 | 1415-08 | 3067-10 | 4243-31 | 1257-03 | 
+ +85 37 1:34 4-74 16-51 56:94 | 195-02 | 654-85 2042-00 | 4852-41 | 3500-67 
+ -90 03 12 -49 1:93 7-82 | 33-12 | 148-67 | 706-81 | 3278-45 | 8051-44 
ep? 00 00 ‘Ol OSs". 17 ‘89 | 5:53 | 43-15 | 467-87 | 6952-68 
+ 1:00 00 -00 00 00 -00 00 00 -00 00 -00 
Mean 0000 | -0970 | +1941 2915 | +3895 | +4881 -5876 6883 7903 8941 
Mode 0000 | -1190 | -2364 | -3507 | -4608 | -5656 | 6646 | +7575 | “8442 | +9250 
Cased 2500 | -2479 2415 -2307 2153 1951 -1696 1384 1006 0551 
(1- p)/Vn-1| -2500 | -2475 2400 | +2275 2100 | +1875 -1600 +1275 -0900 0475 
By 0000 | -0168 0683 “1582 +2934 “4855 -7543 | 1-1348 | 1-6940 | 2-5832 
8, 2-6667 | 2-6960 | 2-7861 | 2-9446 | 3-1855 | 35340 | 4:0837 | 4:7661 | 5-9012 | 7:8748 


Biometrika xt 


r variate (correlation in sample). 
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TABLE A. Ordinates and Constants of Frequency Curves. 


n= 18. p variate (correlation in population sampled). 
0 a 2 Br) ving 5 6 7 8 9 
| 
— 1:00 -00 00 -00 -00 00 00 | -00 -00 -00 ‘00 
- 95 -00 a os ee es Sa ines = = eh 
=| +90 O1 00 Olea shake ae bes ae aa Of 
- +85 -20 05 ‘O1 00; — at Ik a a Be 
- -80 1-23 “32 08 02 00 =| = = aa 
- 75 4-82 1:34 34 08 ‘01 Oe = = cael a 
- -70 14-10 4:24 1-15 | 27 | 05 0) a i ae 
- +65 33-66 10-93 3:16 | “80 | SALT 03 = = a a 
~ 60 69:10 | 24-24 7-52 | 2-02 | “45 08 | -O1 = — — 
- +65 126-18 47:87 | 15-93 4-55 1:08 | -20 | AUB =n aa 
- -d0 209-71 86-05 30:73 9-36 2:35 “46 | 06) == oa aa" 
— 45 322-33 | 143-10 | 54-93 17-88 | 4-76 | “98 | “14 | ‘01 == a 
- -40 463-60 | 222-77 91-98 32-03 9-09 1-98 | “30 03 = = 
— +36 629-39 | 327-48 | 145-60 54-32 16-44 3-80 | “61 -06 = = 
- 30 | 811-85] 457-58 219-30| 87-80} 28-40 7-01 ieity) ‘ll = = 
- +25 999-97 | 610-75 | 315-87 | 135-94 47-10 12-40 | 2-24 23 Ol = 
— -20 1180-56 | 781-66 | 436-73 | 202-39 75°28 21-22 4:09 “44 02 = 
- +15 1339-70 | 961-98 581-30 | 290-60 116-32 35-20 | 7:27 84 ‘03 = 
- -10 1464-32 | 1140-77 | 746-40) 403-25 | 174-11 56-74 12-60 1-57 07 = 
- 06 1543-76 | 1305-33 | 925-87 | 541-62 | 252-92 89-04 21:34 2:86 “14 = 
-00 1571-04 | 1442-41 | 1110-44 | 704:76 | 356-91 | 136-21 35:38 5-13 27 = 
+ -05 1543-76 | 1539-62 | 1288-06 | 888-76 | 489-54 | 203-29 57-47 9-09 51 00 
+ +10 1464-32 | 1587-04 | 1444-69 | 1086-05 | 652-63 | 296-06 91-57 15-86 98 01 
+ -15 1339-70 | 1578-54 | 1565-59 | 1285-14 | 845-22 | 420-66) 143-10 27:33 1:87 ‘01 
:20 1180-56 | 1512-94 | 1637-00 | 1470-71 | 1062-25 | 582-69 | 219-31 46-52 3°55 02 
+. +28 999-97 | 1394-42 | 1648-23 | 1624-60 | 1293-27 | 785-76 | 329-31 78-24 6-73 05 
+ +30 811-85 | 1232-37 | 1593-57 | 1727-60 | 1521-54 | 1029-39 | 483-81 | 129-92 12-75 ‘ll 
+ +35 629-39 | 1040-48 | 1473-96 | 1762-12 | 1723-97 | 1306-25 | 693-90 | 212-79 24-12 24 
+ -40 463-60 | 835-02 | 1297-82 | 1715-64 | 1872-57 | 1598-97 | 968-37 | 343-11 45-64 b4 
+ -46 322-33 | 632-84 | 1080-74 | 1584-29 | 1937-95 | 1877-54 | 1308-88 | 543-00 86-32 1-26 
+ -d0 209-71 | 449-00 | 843-82 | 1375-79 | 1895-25 | 2098-72 | 1702-25 | 839-57 | 162-94 3-00 
+ +65 126-18 | 294-75 | 610-56 1110-63 | 1731-96 | 2209-80 | 2110-51 | 1259-46 | 306-15 7:37 
+ -60 | 69:10 | 176-17 402-90 | 820-29 | 1456-10 | 2159-46 | 2461-52 | 1813-97 | 569-59 18-84 
+ 65 33-66 93-71 | 236-98 541-90 1101-45 | 1917-07 | 2648-10 | 2468-15 | 1039-63 50:19 
+ -70 14-10 42:89 | 120-15 | 309-66 | 725-26 | 1497-24 | 2550-03 | 3092-09 | 1830-85 | 139-75 
+ -75 4-82 16-03 | 49-82 | 145-26 | 394-72 | 977-87 | 2094-49 | 3417-34 | 3017-33 | 406-10 
SoU 1-23 448 15:47) 51:23) 162-68 | 489-96 | 1350-14 | 3087-13 | 4385-61 | 1213-45 
+ +85 20 “79 | 3°05 11-51 43:06 | 159-94 | 582-14 | 1960-44 | 4953-44 | 3534-99 
+ -90 ‘O1 05 | 26 1-14 5-05 23:50 | 116-49 | 614-60 | 3164-79 | 8320-69 
+ 95 -00 -00 -00 ‘01 -08 47 3°32 29-69 | 377-45 | 6744-32 
+ 1:00 00 00 00 00 -00 -00 -00 -00 -00 -00 
Mean 0000 | -0971 | -1944 | -2920 | -3901 | -4888 | -5884 | -6890 | -7909 | -8945 
Mode 0000 -1176 2338 3472 4567 5613 6605 *7540 8417 9236 
o 2425 2405 2342 2237 2086 -1889 1641 1337 0970 0530 
(pV 2425 -2401 2328 2207 2037 “1819 1552 1237 0873 0461 
1 0000 0161 0653 ‘1511 -2797 4617 ‘7147 | 1-:0695 | 1-5839 | 2-3830 
2 2-6842 | 2-7124 | 2-7992 | 2-9515 | 3-1823 | 3-5144 | 3-9873 | 4-6737 | 5-7207 | 7-4908 


r variate (correlation in sample). 
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TABLE A—(continued). 


n= 19. p variate (correlation in population sampled). 
0 1 2 3 4 5 6 a 88 9 
_ | 
— 1:00 -00 00 -00 -00 -00 ‘OO -00 -00 -00 -00 
= 95 -00 ee. fies — ate a a —s a — 
— -90 -O1 -00 — = — = = — — — 
—- 85 ‘ll “02 00 -00 — = = 
— +80 -76 18 “04 -O1 -00 — —- — - — 
- 75 3:29 *85 +20 “04 ‘Ol — — — — = 
—- -70 10°39 2-90 -73 “16 ‘03 -00 — — = — 
— 65 26°39 8-00 2-15 -50 “10 ‘OL | — — - — 
— 60 57-03 18:78 5:43 1:35 27 -O4 ‘00 — — = 
—- +00 108-73 38-90 12-11 3-21 ‘70 ‘12 -O] = — -- 
— +00 187-37 72-86 24:46 6-94 1-60 ‘28 -03 ‘00 — — 
— +45 296-98 | 125-54 45:49 13-84. 3°41 64 ‘08 OL — — 
— -40 438-38 | 201-54 78-91 25-79 6-79 1:35 18 ‘Ol —— — 
ah) 608-28 | 304-27 128-86 45-32 12-77 2-71 +39 -03 — —_— 
— -30 799-03 | 435-05 199-51 75:63 22-88 5-19 -79 ‘O07 a | — 
— +20 998-93 | 592-25 | 294-45 120-51 39-21 9-54 1:56 “14 00 — 
—- -20 1193-40 | 770-79 | 415-93 184:13 64:59 16:89 2-96 28 ‘OL — 
= 15 1366-56 | 961-93 | 564-06 | 270-60 102-59 28:93 5:45 -56 “02 oe 
= -10 1503-20 | 1153-66 | 736-02 | 383-39 157-51 48-04 9-76 1:07 ‘O04 —_ 
-— +05 1590-74 | 1331-66 | 925-52 | 524-53 | 234-18 77-52 17-08 2-03 -08 — 
00 1620-88 | 1480-70 | 1122-52 | 693-62 | 337-49 121-70 29-20 3:78 ‘16 — 
+ 06 1590-74 | 1586-45 | 1313-57 886-92 | 471-76 186-07 48°84 6-93 +33 — 
+ +10 1503-20 | 1637-37 | 1482-73 | 1096-42 | 639-60 | 277-05 80-00 12-50 66 -00 
+ Ld 1366-56 | 1626-51 | 1613-08 | 1309-42 | 840-60 | 401-73 128-32 22-24. 1:30 ‘O01 
+ -20 1193-40 | 1552-77 | 1688-90 | 1508-71 | 1069-70 | 566-77 201-53 39-05 2-57 ‘OL 
+ +25 998-93 | 1421-51 | 1698-12 | 1673-64 | 1315-58 | 776-85 | 309-60 67-65 5-04 -03 
+ +30 799-03 | 1244-13 | 1634-75 | 1782-33 | 1559-56 | 1032-16 | 464-51 115-58 9-90 -06 
+ +39 608-28 | 1036-82 | 1500-76 | 1815-10 | 1775-54 | 1325-12 | 679-05 194-50 19-43 | “15 
+ -40 438-38 | 818:35 | 1306-92 | 1758-51 | 1931-83 | 1636-61 963:78 | 321-76 38-08 +35 
+ 45 296-98 | 607-48 | 1072-08 | 1609-70 | 1995-55 | 1932-88 | 1321-43 | 521-51 74:55 *85 
+ +30 187-37 420-16 | 820-76 | 1379-49 | 1939-87 | 2165-06 | 1738-01 824-05 145-56 | 2-12 
+ +00 108-73 | 267-40 | 579-15 | 1093-22 | 1753-38 | 2274-19 | 2171-08 | 1259-96 282-59 5:48 
+ 60 57-03 153-91 370-23 787-57 | 1449-19 | 2204-82 | 2539-21 | 1843-14 542-34 | 14:73 
+ 66 26:39 78:18 | 209-24 | 503-43 | 1069-45 | 1928-16 | 2722-44 | 2535-20 1018-65 | 41-32 
+ 70 10-39 | 33:8] 100-85 | 275-47 | 680-14 | 1469-57 | 2591-04 | 3189-48 1839-04 | 121-27 
+ +75 3:29 11-76 39-19 122-01 352-63 | 924-51 2078-60 | 3505-34 3087-80 | 371-57 
+ -80 -76 3:00 11-17 39°80 135-72 | 437-70 | 1285-55 | 3100-93 | 4523-43 | 1169-00 
+ 85 ‘ll “47 1-96 8-01 32-49 130-90 | 516-43 | 1878-29 | 5046-26 | 3562-39 
+ 90 ‘Ol 02 “14 “67 3:25 16-63 91:09 | 533-33 3048-86 8581-50 
+ 95 -00 :00 -00 ‘O01 04 “25 | 1:99 20-38 | 303-89 | 6528-96 
+ 1:00 -00 -00 -00 -00 -00 ‘00 -00 ‘00 ‘00 -00 
| 
Mean -0000 -0973 +1947 -2924 +3906 +4894 +5890 -6896 ‘7915 *8948 
Mode “0000 *1165 +2316 *3442 -4531 -5576 +6569 *7509 °8394 9224 
o +2357 2337 2275 +2172 +2025 -1832 | -1590 -1294 | » -0937 ‘0511 
(1 - p2)(Vn =] -2357 +2333 2263 +2145 -1980 ‘1768 -1508 -1202 | -0849 ‘0448 
By -0000 -0154 -0626 -1446 -2672 -4400 -6789 1-0110 1-4866 2-2105 
By 2-7000 | 2-7272 | 2-8109 | 2-9573 | 3-1787 | 3-4958 | 3-9447 | 4:5897 | 5-5597 | 7-1586 


396 Distribution of Correlation Coefficient in Small Samples 


TABLE A. Ordinates and Constants of Frequency Curves. 


ry variate (correlation in sample). 


Ne, p variate (correlation in population sampled). 
OF) gel 2 3 -4 5 6 7 8 9 
— 1:00 00 00 -00 | 00 00 00 -00 -00 -00 -00 
255-95 | tae ee = pee als at 
=) WL, 00 | 00 = — — — = = = = 
- +865 -06 ‘O1 -00 — — = = = = == 
aus 47| +10 02 00:|| == = = = = — 
- 75 2-24 | D3 ‘11 | 02 00 = = = = a 
- -70 7-64 1-99 “46 | 10 02 -00 = = = = 
— 66 20-65 5°85 1-46 31 05 ‘Ol — = = == 
- -60 46-98 14-53 3-91 -90 17 02 “00 — — = 
— +06 93-51 31-56 9-20 2-26 “45 ‘O07 ‘01 = = = 
- 50 167-11 | 61:57] 19-43 5-14 1-09 17 02} — — — 
— 46 273:13 | 109-93 | 37-61 10-70 2-44 “42 05 00 — = 
- -40 413-76 | 181-99 | 67-57 20:74 5-06 *92 all ‘Ol — = 
- +35 586-81 | 282-18 | 113-83} 37-75 9:9) 1-93 +25 02 — = 
- +30 784-96 | 412:86 | 181-17) 65-03 18-39 3°84 53 “04. — = 
— +25 996-07 | 573-27 | 273-98 | 106-64 32-57 7:32 1-08 08 = = 
- :20 1204-17 | 758-68 | 395-39 | 167-20 55°31 13-42 2-13 18 00 = 
= Ol) 1391-40 | 960-11 | 546-33) 251-51 90-32 23°73 4-07 37 ‘Ol — 
- +10 1540-28 | 1164:55 | 724-46 | 363-84 | 142-24 40-60 7:55 ‘73 02 —_— 
- 05 1636-14 | 1356-03 | 923-47 | 507-05 | 216-43 67-37 13-64 1-44 05 = 
00 1669-24 | 1517-23 | 1132-65 | 681-41 | 318-54 | 108-54 24-06 2-78 -10 —— 
+ +05 1636-14 | 1631-71 | 1337-14 | 883-46 | 453-78 | 169-98 41-43, 5:27 21 a 
+ +10 1540-28 | 1686-21 | 1518-98 | 1104-85 | 625-68 | 258-79 69-76 9-84 “44 = 
Slo) 1391-40 | 1672-86 | 1658-98 | 1331-72 | 834-48 | 382-94] 114-86 18-07 90 00 
+ +20 1204-17 | 1590-74 | 1739-26 | 1544-87 | 1075-23 | 550-27 | 184-86 32-72 1-85 ‘01 
+ 26 996-07 | 1446-47 | 1746-32 | 1720-99 | 1335-82 | 766-63 | 290-54 58-39 3°77 02 
+ -30 784-96 | 1253-70 | 1673-93 | 1835-44 | 1595-60 | 1033-05 | 445-18 | 102-63 7-68 O04 
+ +38 586-81 | 1031-28 | 1525-25 | 1866-25 | 1825-32 | 1341-82 | 663-32 | 177-47 15-62 “09 
| “40 413-76 | 800-54 | 1313-68 | 1799-15 | 1989-34 | 1672-08 | 957-45 | 301-19 31-71 22 
| + -45 273:13 | 582-06 | 1061-54 | 1632-51 | 2051-11 | 1986-22 | 1331-67 | 499-96 64-26 57 
+ -30 167-11 | 392-47 | 796-88 | 1380-68 | 1981-92 | 2229-43 | 1771-29 | 807-35 | 129-80 1-50 
+ +36 93:51 | 242-15 | 548-35 | 1074-11 | 1771-83 | 2336-21 | 2229-32 | 1258-17 | 260-36 4-07 
+ -60 46-98 | 134-22 | 339-60 | 754-79 | 1439-68 | 2247-04 | 2614-58 | 1869-39 | 515-46 11-50 
+ 66 20-65 65-11 | 184-41 | 466-85 | 1036-50 | 1935-78 | 2793-79 | 2599-35 | 996-30 33-96 
+ 7 7:64 26-60 | 84:50} 244-62 636-66 | 1439-79 | 2627-94 | 3283-99 | 1843-93 | 105-05 
+ +75 2-24 8-62} 30-77 | 102-29 | 314-46 | 872-48 | 2059-10 | 3589-11 | 3154-22 | 339-37 
+ -80 “47 2-00 | 8:05 30-87 | 113-01 | 390-31 | 1221-83 | 3109-18 | 4657-18 | 1124-15 
+ +85 06 -28 | 1-25 5:56 24:47 | 106-93 | 457-32 | 1796-33 | 5131-57 | 3583-56 
+ -90 00 :00 | 08 “39 2-09 11-76 71-10 | 461-97 | 2931-90 | 8834-61 
+ 965 — — | 00 -00 02 13 1-19 13-97 | 244-23 | 6309-15 
+ 1:00 00 00 | -00 00 -00 00 -00 -00 -00 -00 
| | 
Mean 0000 | -0974 | -1950 | -2928 | -3911 | -4900 | -5896 | -6902 | -7919 | -8951 
Mode 0000 | -1154 | -2997 | -3415 | -4500 | -5543 | -6538 | -7482 | -8374 | -9213 
o 2294. 2274 -2214 2118 +1969 1780 +1543 1254 0907 0493 
(l- pi/Vn—1 2294 227) +2202 +2088 -1927 1721 -1468 “1170 -0826 -0436 
By 0000 -0148 -0600 1386 *2557 +4202 6464 9584 | 1-4001 | 2-0603 
By 2:7143 | 2-7406 | 2-8213 | 2-9623 | 3-1749 | 3:4783 | 3-9055 | 4:5131 | 5-4154 | 6-8681 


y variate (correlation in sample). 
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TABLE A—(continued). 


n= 21. p variate (correlation in population sampled). 
: : 
0 | 2 oil ae 5 ‘On Geert 8 9 
| : | ‘one i 

— 1-00 00 00 00 | 00. 00 00. 00 00 | 00 00 | 

— 90 00 00 = = — = = = = = 

= 85 03 ‘O01 00); — — = = = = = 

- -80 -29 06 01 | 00) — — = = = = 

- 75 1-52 “34 07 | ‘O01 00 = = = = = 

= 5-61 1:36 “29 05 ‘01 = = = = a 

- 65 16-13 4:27 “99 19 | 03 = = = = 

- 60 38°65 11-22 2-82 60 “10 — = = = 

- 66 80-30 25-56 6-97 | 1-59 | -29 00 

- 80 148-80 51-95 15-41 3°79 “74 ‘O01 — = = 

— 45 250-78 96-10 31-04 | 8-26 | 1-74 03 = = 

— -40 389-90 | 164-07 57:77 | 16-64 | 3-77 07 00 = = 

— 3d 565-17 | 261-27 | 100-39 | 31:39 7:67 “16 ‘O01 = = 

-— 30 769-89 | 391-17 | 164-25 55-82 14-76 “35 02 = == 

- +25 991-59 | 553-98 | 254:51) 94:20 27-02 ‘75 05 _ = 

- +20 1213-06 | 745-68 | 375-26 | 151-59 47-28 | 1-53 “| 00 — 

- +15 1414-38 | 956-73 | 528-29 | 233-39 79-38 | 3-04 24 | Ol) — 

- 10 1575-70 | 1173-63 | 711-91 | 344-73 | 128-23 5:83 50 OD) 
05 1680-10 | 1378-59 | 919-93 | 489-35 | 199-70 10-88 1-02 03 — 
00 1716-23 | 1552-13 | 1141-00 | 668-33 | 300-17 19-79 2-04 06 — 

+ -065 1680-10 | 1675-52 | 1358-90 | 878-59 | 435-79 35:09 4-01 “14 — 

+ -10 1575-70 | 1733-68 | 1553-58 | 1111-53 | 611-07 60-74 7133 || -29 — 

+ +15 1414-38 | 1717-73 | 1703-39 | 1352-20 | 827-05 102-64 14-66 | 63 — 

+ -20 1213-06 | 1626-97 | 1788-20 | 1579-31 | 1079-03 169-29 27-37 1:33 00 

+ +25 991-59 | 1469-48 | 1792-97 | 1766-81 | 1354-17 272-20 50-31] 2-82 ‘01 

+ 30 769-89 | 1261-29 | 1711-25 | 1887-05 | 1629-83 425-95 | 90-99 5-95 02 

+ +35 565-17 | 1024-10 | 1547-62 | 1915-72 | 1873-44 | 1356-52 | 646-90 | 161-66 12-53 06 

+ -40 389-90 | 781-85 | 1318-32 | 1837-75 | 2045-23 | 1705-56 | 949-63 | 281-48 26-36 ‘14 

+ -45 250-78 | 556-80 | 1049-40 | 1652-96 | 2104-80 | 2037-72 | 1339-82 | 478-52 | 55-31 “38 

+ -50 148-80 | 366-00 | 772-43 | 1379-62 | 2021-59 | 2291-99 | 1802-29 | 789-71 | 115-56 | 1-06 

+ +05 80-30 | 218-92 | 518-34 | 1053-63 | 1787-57 | 2396-03 | 2285-43 | 1254-35 | 239-50 | 3-02 | 

+ -60 38-65 | 116-85 | 311-00 | 722-19 | 1427-92 | 2286-37 | 2687-85 | 1892-96 | 489-12 | 8-96 

+ -65 16-13 54:13 | 162-26 | 432-22 | 1002-93 | 1940-29 | 2862-39 | 2660-83 | 972-87 27-86 

+ -70 5:61 20-90 70-68 | 216-87 | 595-00 | 1408-33 | 2661-07 | 3375-86 | 1845-86 90-85 

+ 75 1-52 6:31 24-12 85:61 | 279-97 | 822-04 | 2036-49 | 3668-97 | 3216-90 | 309-47 

+ -80 29 1:34 5-79 23-90 93:95 | 347-48 | 1159-40 | 3112-44 | 4787-20 | 1079-30 

+ -85 03 | “16 “80 3°86 18-41 | 87-22 | 404-32 | 1715-20 | 5209-97 | 3599-09 

+ +90 -00 00 “04 23 1-34 | 8-29 55:40 | 399-51 | 2814-91 | 9080-68 

+ -95 _— = -00 00 ‘01 07 “71 9-56 | 195-96 | 6087-03 

+ 1:00 00 | 00 00 00 00 -00 ‘00 00 | 00 00 

Mean 0000 0976 1952 2932 3916-4905 5902 6907 | -7924 8954. 

Mode 0000 1145 2279 3391 4472 5515-6509 | -7457 8354 9203 
o 2236 +2216 2157 2058 SIS UT/ 1732 1500 -1218 | -0880 0478 

(1 - p2)//Vn—1] +2236 2214 2147 “2085 -1878 “1677 1431 -1140 0805 0425 
1 -0000 0142 0577 1331 -2451  -4020 -6166 -9107 | 1:3227 | 1-9288 
Bo 2:7273 | 27527 | 2-8306 | 2-9666 | 3-1711 | 3-4617 | 3-8683 | 4-4432 | 5-2858 | 6-6169 

| 


ry variate (correlation in sample). 


398 Distribution of Correlation Coefficient in Small Samples 


I 
bo 


n 


TABLE A. Ordinates and Constants of Frequency Curves. 


p variate (correlation in population sampled). 


0 1 2 3 4 5 6 7 8 9 
— 1-00 00 00 00 00 00 00 00 00 00 00 
= 95 | -00 ss — a _— ass, = — = = 
— +90 | -O1 | es — ane = ea ee a — = 
— +85 02 00 00) = — — _ — = = 
- -80 18 033) -014 00; — _ = am 3 
- 5 1-03 21 04) 01 — _ — = = 
- -70 4-11 92 18| 03 00; — | — — = = 
- 65 12-59 3-11] —_-67 | 12 02 00; — = = a 
- -60 31-74 | 8-65 2-02 -40 06 ‘O11; — = = 5 
- +55 68:85 | 20-67 5-28 1-12 19 02 00; — = = 
— +50 132:30'| © 43-77 | 12:21 2:80 50) 07 01] — = = 
- -45 229-92 | 83:90| 25:58) 6-36 1-24 17 02) — = aa 
— -40 366-87 | 147-70 | 49-31] 13-34] 2-80 43 04 00; — = 
- +35 543-53 | 241-56| 88-41} 26-06] 5-93 ‘97 10 0) = 
- 30 754-00 | 370-08 | 148-69) 47-84| 11:83] 2-09 23 01); — = 
- -25 | 985-69 | 534:57| 236-09/ 83-10] 22:38] 4-29 52 03) = = 
- -20 | 1220-22 | 731-56 | 355-63 137-23] 40:36 | 8-43 1-10 07| = = 
- -15 | 1435-65 | 951-98 | 510-10) 216-26] 69-67| 15:89] 2-26 16 00; — 
- 10 1609-59 | 1181-06 | 698-57, 326-14] 115-44] 28-86) 4-49 34 Ol; — 
— 05 1722-72 | 1399-48 | 915-06 | 471-58 | 183-99] 50-64] 8-67 ‘72 02;  — 
00 1761-97 | 1585-51 | 1147-75 | 654-54 | 282-44] 85-93) 16-25 1-50 04) — 
+ 05 1722-72 | 1718-00 | 1379-01 | 872-46 | 417-89| 141-20] 29-67] 3-04 09 | — 
+ +10 1609-59 | 1779-88 | 1586-65 | 1116-63 | 595-93 | 224-73 | 52:80] 6-06 20) ees 
+ +15 1435-65 | 1761-23 | 1746-45 | 1370-99 | 818-50 | 346-33} 91-59] 11-87 43) — 
+ 20 1220-22 | 1661-60 | 1835-83 | 1612-18 | 1081-27 | 516-27 | 154-80 | 22-86 96 |  — 
+ +25 | 985-69 | 1490-67 | 1838-17 | 1811-21 | 1370-78 | 743-10 | 254-65 | 43-29) 2:10 00 
+ 30 754-00 | 1267-07 | 1746-86 | 1937-29 | 1662-37 | 1029-97 | 406-96 | 80-55) 4-60 01 
+ 35 543-53 | 1015-49 1568-03 | 1963-65 | 1920-03 | 1369-38 | 629-96 | 147-04 10-04 03 
+ -40 366-87 | 762-48 | 1321-06 | 1874-43 | 2099-64 | 1737-17 | 940-51 | 262-68 | 21-88 09 
+ 45 229-92 | 531-86 | 1035-89 | 1671-23 | 2156-76 | 2087-52 | 1346-06 | 457-35 | 47-54 26 
+ 50 132-30 | 340-82 | 747-64 | 1376-56 | 2059-06 | 2352-90 | 1831-17 | 771-34 | 102-73 ‘75 
+ 55 68-85 | 197-63 | 489-26 | 1032-04 | 1800-84 | 2453-82 | 2339-55 | 1248-74 | 219-99) 2-23 
+ -60 31-74 | 101-58 | 284-38 | 690-00 1414-20 | 2323-01 | 2759-17 | 1914-05 | 463-46} 6-97 
+ 65 1259 | 44-94 | 142-57 399-58 | 969-04 | 1942-00 | 2928-43 | 2719-83 | 948-62 | 22-83 
+ -70 411} 16:39 59-04 191-99 | 555-25 | 1375-57 | 2690-72 | 3465-29 1845-14) 78-45 
+ 75 1:03] 4:60} 18-88} 71-56 | 248-90 | 773-40 | 2011-22 | 3745-20 | 3276-10) 281-79 
+ +80 18 ‘89 4-16 -:18-48 | 78-00 | 308-91 | 1098-57 | 3111-21 | 4913-78 | 1034-74 
+ 85 02 10 51) 2-67 | 13:82} 71-03 | 356-95 | 1635-37 | 5281-97 | 3609-52 
+ 90 ‘01 00| 02} -13| +86] 584] 43-11] 345-00 | 2698-71 | 9320-25 
+ 95 ‘00; — 00 00 00 04 -42| 6-53 | 157-01 | 5864-34 
+ 1-00 00 00 00-00 -00 00 00 00 00 -00 
| =, 
Mean 0000 | -0977 | -1955 | -2935 | -3920 | -4910 | -5906 | -6912 | -7928 | -8956 
Mode 0000 | -1137 | -2264 | -3369 | -4447 | -5486 | -6484 | -7435 | -8339 | -9194 
o -2182 | -2162 | -2105 | -2007 | -1869 | -1688 | -1461 | -1185 | -0855 | -0464 
(1- p)/Vn—1] +2182 | +2160 | -2095 | -1986 | -1833 | -1637 | -1396 | -1113 | -0786 | -0415 
By 0000 | -0137 | -0555 | -1279 | -2354 | -3853 | -5893 | -8674 | 1-2532 | 1-8125 
2:7391 | 2-7630 | 2-8390 | 2-9703 | 3-1672 | 3-446] | 3-8328 | 4:3790 | 5-1687 | 6-3926 


2 
2 
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TABLE A—(continued). 


y variate (correlation in sample). 


n = 23. p variate (correlation in population sampled). 
0 7 2 3 4 5 6 7 8 9 
— 1:00 -00 00 | -00 00 | -00 -00 -00 -00 :00 :00 | 
— “95 -00 ate, = foes pont — —— es — pass 
=: 90 -00 a — a= =e — — — — er 
= 85 ‘Ol 00 = = — — -- — — — 
= HO ‘ll 02 = — — — — — — — 
75 70 13 E00 ir 2 eee he = =r = cs 
= 0 3:01 63 ‘Il 00 00; — — = — — 
= 65 9-81 2-26 “45 -08 ‘Ol = = = = = 
OO, 26-04 6-66 1-45 26 | ‘04 | 00 — 
= +99 58-96 16-69 3:99 78 | 12 | ‘OL — = = = 
=| -O0 117:47 36°83 9-65 2:06 | 34 | 04 00 a = = 
= offs 210-52 73-14 21-06 4-90 “88 | ‘11 ‘Ol a = = 
40) 344-75 132-79 42-04 10-68 2-08 +29 “02 — == == 
= :38 522-03 | 223-04 77:76 21-61 4:58 69 -06 -00 — — 
— -30 737-47 349-67 134-43 40-95 9-47 1-54 “15 ‘O01 — — 
= oesy 978-54 | 515-16) 218-71 73-21 18°51 3:28 36 02 -— — 
— -20 1225-82 | 716:90 | 336-58 124-07 34:41 6-67 ‘79 “04 — — 
= ole) 1455:33 946-00 | 491-89 200-13 61-07 12-98 1:68 ‘10 —- — 
=) 6) 1642-05 | 1186-97 | 684:57 | 308-15 103-79 24-29 | 3°46 | 23 -00 — 
05 1764-10 | 1418-82 | 909-02 | 453-86 169-29 43°81 | 6:89 “51 -O1 — 
00 1806-56 | 1617-48 | 1153-02 | 640-19 |} 265-41 76:30 | 13°33 1-10 -02 = 
+ 05 1764-10 | 1759-25 | 1397-58 | 865-25 | 400-21 128-44 25-06 2°30 ‘06 — 
aE IK) 1642-05 | 1824-91 | 1618-30 | 1120-27 | 580-40 | 208-99 | 45-84. 4:75 13 = 
aE og 5) 1455-33 | 1803-46 | 1788-25 | 1388-22 | 808-98 | 328-68 81-62 9-60 +30 — 
+ -20 1225-82 | 1694-75 | 1882-27 | 1643-56 | 1082-09 | 499-05 141-37 19:07 -69 a 
+ 1°25 978-54 | 1510-18 | 1882-05 | 1854-28 | 1385-76 | 730-12 | 237-92 37-20 | 1:56 00 
+ +30 737-47 | 1271-20 | 1780-86 | 1986-26 | 1693-34 | 1026-34 | 388-31 71-21 3°55 01 | 
Ey sys) 522-03 | 1005-63 | 1586-63 2010-13 | 1965-21 | 1380-56 | 612-67 133-58 | 8-04 -02 
+ -40 344-75 | 742-61 | 1322-06 | 1909-35 | 2152-67 | 1767-06 | 930-25 | 244-82 18:14 -06 
+ 45 210-52 | 507-37 | 1021-21 | 1687-50 | 2207-11 | 2135-75 | 1350-57 | 436-53 40-80 ia 
+ +50 117-47 | 316-95 | 722-70 | 1371-71 | 2094-49 | 2412-26 | 1858-09 | 752-42 91-21 +52 
TE sys) 58-96 178-18 | 461-21 | 1009-57 | 1811-83 | 2509-73 | 2391-84 | 1241-53 | 201-81 1:65 
+ :60 26-04 88-20 | 259-71 | 658-38 | 1398-78 | 2357-16 | 2828-70 | 1932-86 | 438-57 5-42 | 
+ 66 9-81 37°26 125:10 368-92 | 935-07 | 1941-17 | 2992-09 | 2776-53 | 923-77 18-68 | 
+ 70. 3:01 12°84 | 49-25] 169-74 | 517-49 | 1341-82 | 2717-16 | 3552-48 | 1842-02 67-66 | 
+ 75 -70 3:36 | 14-76 59:73 | 220-99 726-69 | 1983-68 | 3818-06 | 3332-08 | 256-26 | 
+ -80 ‘11 | -59 | 2-99 14-27 64-66 | 274-26 | 1039-58 | 3105-96 | 5037-18 | 990-75 | 
+ +85 ‘01 -06 “33 | 1:85 10:37 57-78 | 314-71 | 1557-24 | 5348-05 | 3615-31 | 
+ 90 -00 -00 ‘Ol ‘08 | +55 4:1] 33°50 | 297-55 | 2583-97 | 9553-82 
+ 95 -00 -00 00 | -00 | -00 “02 +25 4-46 125-64 | 5642-53 
+ 1:00 -00 | -00 -00 -00 | 00 | -00 00 -00 -00 -00 
Mean 0000 | -0978 -1957 +2938 +3923 4914 “5911 -6916 -7931 8958 | 
Mode -0000 | -1130 -2250 +3351 | -4424 +5462 6461 “7415 *8324 ‘9185 | 
o +2132 | -2113 +2056 -1960 -1825 +1647 +1425 “1155 0832 +0450 
(1 - p2y/Vn -1 +2132 2111 | -2047 ‘1940 | -1791 *1599 1364 ‘1087 | -0768 ‘04.05 
By -0000 -0132 *0535 +1232 +2264 +3698 *5645 *8279 | 1-1905 1-7092 
Bo 2-7500 | 2:7738 | 2-8467 | 2-9735 | 3-1633 | 3-4313 3°8024 | 4:3197 | 5-0623 | 6-1951 


r variate (correlation in sample). 


400 Distribution of Correlation Coefficient in Small Samples 


TABLE A. Ordinates and Constants of Frequency Curves. 


n = 24, p variate (correlation in population sampled). 
0 -] 2 3 4 5 6 7 -8 9 
— 1:00 -00 -00 00 -00 -00 -00 -00 -00 -00 -00 
= 95 asx are = | ae S==%, Paes pen) eet ¢ Pars. Ries 
a “90 -00 a pa — = sea —s eae pleas =i 
Ess Ol om 2a eee = = 2 at = =e 
BET) 07 -00 00) eo = = _ = = a mt 
er 48 -08 Ont a aes = ie = = me 
=e) 2-20 -43 07 | ‘OOM — —_ a — a 
— 65 7-63 1-65 30, 05 008 "ea = = = =: 
22-60 21-33 5-12) 1-04 | 17 -02 00} = ae — pa 
— 5 50-42] 13-46) 3-01 | 55 -08 -016|o) =o a — = 
-— -50 104:18 | 30-95 7-63 1-52 23 -03 00) = = = 
= 45 192-53 63-69 17:31 3-76 63 -08 | -O1 == == =e 
= 40) 323-58 | 119-24 35-80 8-54 1:54 20 “023 |e = te 
e385 500-79 | 205-69 68-31 17:90 3-53 -49 04 as — ue 
— -30 720-45 | 329-99 | 121-39) 35:02| 7-57 1:13 -10 00)| Sen a 
2-25 970-29 | 495-86 | 202:37| 64:42) 15-29 2-51 25 -O1 pee ie 
ey, 1229-99 | 701-70} 318-18 112-03 29-30 5-27 “57 -03 _ Ea 
~ +15 1473-52 | 938-94 | 473-77| 184-97) 53-46| 10-58 1-25 -07 ae eat 
ity) 1673-17 1191-49 670-05 | 290-81} 93-20) 20-41 2-66 16 00; — 
~ -05 | 1804-33 | 1436-72 901-95 436-29 | 155-59 | 37-86 5-48 36 -O1 ae: 
00 1850-07 | 1648-13. 1156-93 | 625-41 | 249-11| 67:67 | 10-92 | -80 O01 our 
+ +05 1804-33 | 1799-35 | 1414-71 | 857-07 | 382-82] 116:68| 21-14 1-74 -04 nas 
4+ +10 1673-17 | 1868-86 | 1648-62 | 1122-60 | 564-61 | 194-13 | 39-75 3-71 -09 a 
4 16 1473-52 | 1844-51 | 1828-88 | 1403-99 | 798-61 | 311-57 | 72:65| 7-76 -21 wee 
+ +20 1229-99 | 1726-50 | 1927-58 | 1673-57 | 1081-63 | 481-83 | 128-95 15:89 -49 ib 
4+ +25 970-29 | 1528-14 | 1924-68 | 1896-13 | 1399-26 | 716-52 | 222-03) 31-93 1G es ea 
a1) 720-45 | 1273-84 | 1813-38 | 2034-06 | 1722-83 | 1021-51 | 370-08 | 62-89 2-74 00 
35 500-79 | 994-68 | 1603-54 | 2055-28 | 2009-07 | 1390-19 | 595-15 | 121-20 6-42 Ol 
+ -40 323-58 | 722-40 | 1321-50 | 1942-61 | 2204-43 | 1795-33 | 919-02) 227-90] 15-02 -04 
+ 45 192-53 ° 483-43 | 1005-54 | 1701-90 | 2255-95 | 2182-51 | 1353-49 | 416-:18| 34-98 -12 
+ -50 104-18 294-40) 697-76 | 1365-25 2128-00 | 2470-20 | 1883-18 733-09 | 80-88 37 
4+ +55 50-42 | 160-45 | 434-25 | 986-41 | 1820-73 | 2563-87 | 2442-41 | 1232-91 184-91 1-22 
+ -60 21:33) 76-48 | 236-89 627-46 | 1381-90 | 2388-99 | 2896-56 | 1949-56 | 414-53 4:21 
+ 65 7-63| 30-86 | 109-64 340-21 | 901-23 | 1938-06 | 3053-53. | 2831-08 | 898-52] 15-27 
+ +70 2:20| 10:05) 41-03 | 149-89 | 481-72 | 1307-36 | 2740-64 | 3637-57 | 1836-75 | 58-29 
ey fs) | -48 2-45) 11-52 49-80) 195-98 | 682-00 | 1954-22 | 3887-76 | 3385-03 | 232-77 
2280 -07 39} 214) 11:00, 53:55 | 243-21 | 982-60 | 3097-07 | 5157-63 | 947-52 
+ 85 -01 | 03 21| 1:28) 7-76 | 46-94 277-15 | 1481-10 | 5408-62 | 3616-87 
+ -90 -00 00 | Ol | 04 | +35 2-89 | 26-01 256-32 | 2471-22 | 9781-81 
95 -00 | 00 | 00 | -00 -00 -O1 “15 3-04 | 100-42 | 5422-78 
+ 1:00 00 -00 | 00 | -00 -00 -00 -00 -00 -00 -00 
| | =: = 
Mean 0000 | -0979 —--1959 | -2941 | +3927. -4918 | -5915 | -6920 | -7934 | -8960 
Mode 0000 | -1124 | -2238 | -3334 | -4404 | -5441 | -6440 | -7397 | -8310| -9178 
o 2085 | -2067 | -2011 | -1916 | -1783 | -1609 1391 | -1127 | -0811 | -0438 
(1 - p)/Vn—1) -2085 | -2064 | -2002 | -1897 | -1752 | -1564 | -1334 | -1063 | -0751 | -0396 
8, 0000 | -0127 | -0516 | -1187 | -2180 | -3557 | -5419 | -7918 | 1:1335 | 1-6167 
By 2:7600 | 2°7826 | 2-8537 | 2-9764 | 38-1596 | 3-4174 | 3-7774 | 4-2653 | 4:9654 | 6-0161 


y variate (correlation in sample). 
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TABLE A—(continued). 


n = 25. p variate (correlation in population sampled). 
0 all 2 “3 “4 mo) 6 arf ‘8 WG) 
- 1:00 00 00 00 00 00 00 00 00 00 00 | 
BS .95 ae a = =F na = =a =e - yas 
2-00 = aan = cs a ne es ee eee pie 
tees 5 00 ae es es = 
er .60 04 (Op = ee ae = = Se ees = 
a HD 32 05 00 ae 2 ue ae 
=) 270 1-61 29 05 008 | += = = =e = = 
- 65 5-93 1-20 21 03 | -00 — — — — — 
— -60 17-46 3°93 “75 12) ‘01 — — — — -—= 
- +06 43-08 10-85 2-27 38 05 00 = a = = 
— :50 92-30 25-99 6-01 111 “16 ‘01 = = ee 
— 45 175-88 55-40 14-22 2-89 “44 05 -00 = —- | — 
— -40 303-38 | 106-96 30-45 6-82 1-14 13 01 
- 35 479-90 | 189-49 59-94. 14-80 2-72 “34 03 — — — 
— :30 703-06 | 311-08 | 109-50 29-91 6-05 83 07 00; — — 
— 25 961-07 | 476-77 | 187-04 56-62 12-62 1-80 Sz) ‘01 — me 
- 20 1232-83 | 686-08 | 300-46 | 101-06 24-92 4-15 “41 02 — — 
— 16 1490-32 | 930-93 | 455-82 | 170-78 46-75 8-62 “93 “04 — — 
- +10 1703-04 | 1194-73 | 655-13 | 274-14 83-60 17-14 2-04 =alsf — — 
- 05 1843-49 | 1453-28 | 893-96 | 418-94 | 142-84 32-68 4-35 *25 00 = 
00 1892-58 | 1677-56 | 1159-60 | 610-31 | 233-56 59-95 8-94 “59 ‘Ol = 
+ 05 1843-49 | 1838-37 | 1430-51 | 848-05 | 365-78 | 105-89 17-81 1-32 02 — 
+ -10 1703-04 | 1911-80 | 1677-69 | 1123-71 | 548-65] 180-12 34-44 2:90 06 — 
+ +16 1490-32 | 1884-46 | 1868-40 | 1418-41 | 787-53) 295-02 64-59 6-26 “14 — 
+ -20 1232-83 | 1756-95 | 1971-86 | 1702-29 | 1080-00 | 464-70 | 117-49 13-22 “35 = 
+ +26 961-07 | 1544-63 | 1966-16 | 1936-84 | 1411-36 | 702-41 | 206-98 27:37 “86 — 
+ -30 703-06 | 1275-10 | 1844-50 | 2080-76 | 1750-96 | 1015-61 | 352-32 55:47 2-11 -00 
+ 35 479-90 | 982-79 | 1618-89 | 2099-17 | 2051-69 | 1398-37 | 577-51 | 109-85 | 5-13 ‘01 
+ -40 303-38 | 701-99 | 1319-51 | 1974-33 | 2255-00 | 1822-10 | 906-95 | 211-92 12-43 02 
+ 40 175-88 | 460-12 | 989-05 | 1714-58 | 2303-40 | 2227-89 | 1354-96 | 396-34 29-95 -08 
+ -60 92:30 | 273-17 | 672-95 | 1357-37 | 2159-72 | 2526-88 | 1906-56 | 713-49 71-65 26 
+ -65 43-08 | 144-33 | 408-43 | 962-75 | 1827-71 | 2616-37 | 2491-37 | 1223-04 169-24 “90 
+ -60 17-46 66-25 | 215-85 | 597-36 | 1363-75 | 2418-65 | 2962-87 | 1964-30 | 391-39 3°26 
+ 66 5:93 25:53 95:99 | 313-40 | 867-68 | 1932-87 | 3112-90 | 2883-60 | 873-02 12-46 
‘70 1-61 7:85 34-15 | 132-22 | 447-95 | 1272-41 | 2761-35 | 3720-72 | 1829-54 50-16 
t 6°75 “32 1-78 8-99 41-47 173-61 | 639-37 | 1923-13 | 3954-50 | 3435-16 | 211-20 
+ -80 04 -26 1-53 8-48 44-30 | 215-45 | 927-75 | 3084-92 | 5275-33 | 905-21 
+ 89 -00 02 13 88 5:81 38:09 | 243-81 | 1407-18 | 5464-05 | 3614-59 
+ -90 = -00 ‘00 -00 “22 2:03 20-16 | 220-57 2360-87 10004-61 
+ .-95 = = = = -00 ‘Ol 09 2-07 80-18 | 5206-06 
+ 1-00 00 -00 00 -00 -00 -00 ‘00 -00 -00 ‘00 
Mean 0000 0980 -1960 2943 -3930 4921 5918 6923 | -7937 8962 
Mode -0000 ‘1118 2227 3318 4385 5421 -6420 -7380 | 8297 ‘9170 
o “2041 +2023 1968 1875 1744 1573 1359 -1100 ‘0791 0427 
(l= p2)/Vn—1| -2041 2021 1960 1858 1715 1531 1306 1041 | 0735 0388 
By -0000 0123 0499 1146 2102 3423 5203 ‘7586 | 1-0816 | 1-5334 
Boy 2-7692 | 2-7916 | 2-8601 | 2-9788 | 3-1559 | 3-4042 | 38-7453 | 4-2149 | 4:8769 | 5-8584 


402 Distribution of Correlation Coefficient in Small Samples 


TABLE A. Ordinates and Constants of Frequency Curves. 


rv variate (correlation in sample). 


n = 50. p variate (correlation in population sampled). 
| 
0 “J 2 3 “4 5 | 6 7, 8 9 
See ae ses ke | 

~ 1-00 00 | 00 00 00 00 -00 | -00 00 -00 67 2 
| <2 95 | coe >| eS S|) = 68 3 
90 _ = = ae = ha 69 6 
Th as a = — = = 70 1-0 
2800 — eu = = = =e as a 71 fle 
45 Sf oe a = = 72 2-9 
70 “O0i) = — -_ = Bee eal eee a a 73 5 
65 | “Ole | a ee ae = 74 8-8 
60 | -10 | “O08 c= = eee ee ae | 75 15-3 
55 | -69 | -04 | 00)" = = ae 76 26-7 
-50| 3-68 27 | ‘OL | DN rh 9 ||P = |e 46-4 
45 = 15-10 1-40 -08 00 | — — |©@ -78 80-4 
-40| 49-85 5-82 | 44 | 02 | | | | SBtey er 168-6 
-35| 136-13) 20-06) 1-88 0A ere 00) | ae = = — | -s0| 238-4 
30) 314-21] 58:57) 6-85 48 ‘021 = sa = || |-oe87 406-0 
.25 | 623-17 | 147-07 | 21-50 1-85 008 = me st -82| 683-7 
20 | 1075-24 | 321-70| 59-02 6-32 36 O01" == a mee as ep ouel 
-15 | 1629-13 | 618-58 14288 19-11 1-35 040 | = sath =, 584918482 
-10 | 2182-12 | 1052-79 | 307-18| 51-61) 4-55 -18 “00)\ ae | |S 85) 9086-7 
-05 | 2595-77 | 1593-19 | 589-27 | 125-03) 13-87 68 (Yl) — |S -86) 4519-5 
-00 | 2749-60 | 2149-47 | 1011-38 | 272-76 | 38-38 2-39 40500) oo — |& -87| 6673-8 
-05 | 2595-77 | 2587-70 | 1554-59 | 535:97| 96:53} 7:70 -20 00| — | 6488) <o34041 
-10 | 2182-12 | 2777-44 | 2138-36 | 948-41 | 220-60) 22-82 78 -O1| — | 289 | 19187-5 
-15 | 1629-13 | 2650-80 | 2625-45 | 1507-84 | 457-26 | 62-11] 2-82 | 02) — | @ -90 | 14502-0 
.20 | 1075-24 | 2239-38 | 2864-46 | 2144-80 | 856-50 | 154-86 9-46 ‘ll| — | @ +97 | 15261-3 
-25 | 623-17 | 1663-32 | 2758-76 | 2712-27 | 1441-46 | 351-85 29-47 48) — |'& -92 | 13599-9 
-30 | 314-21 1076-19 | 2323-98 | 3022-24 2161-66 | 723-72 84-89 1-99 | 00 | © -93 96306 
-35 | 136-13 | 599-07 | 1691-43 | 2932-02 | 2856-13 | 1333-93 | 224-22 7-75 | 02 | . -94| 4918-7 
40 49-85 | 282-26 1046-45 | 2437-43 | 3274-68 | 2172-97 | 536-92 | 28-37 | 09 95 | 1550-4 
45 15:10 110-16) 538-66 | 1700-04 | 3192-62 | 3070-40 | 1147-17 | 96-36 | 51 96 | 230-3 
50 3-68 | 34-61 | 224-22 | 967-26 2575-61 | 3668-50 | 2138-79 | 298-71, 2-85 97 9-6 
55 69 8-42 72-63 | 432-21 | 1656-86 3578-51. 3372-39 | 824-60) 15-26 -98 3 
-60 | 10 151 | 17:37 | 143-93 | 807-35 | 2713-26 | 4300-25 | 1954-75 | 76-77 99 0 

65 01 | 18 284 33:16 276-92 1489-70 4152-55 | 3764-57 | 350-56 

70 00 01 29) 4-73] 59:97 | 532-70 | 2747-94 | 5398-21 | 1367-63 

ih 00 01 35 6-91 | 104-96 | 1061-91 | 4990-96 | 4092-05 

Ly hy en 00 O1 | 32 8-58 181-98 | 2306-06 | 7650-44 

“85 | -00 -00 al 8:16 | 322-77 | 5817-30 

| = a = 00 | 03 4-26 | 621-81 

95 | — ast Se | -00 -00 24 

1-00 00 00 -00 -00 00 00 | -00 -00 00 


For the constants of the curves for »=50: see p. 372 above. 
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TABLE A—(continued). 


m = 100. p variate (correlation in population sampled). 
0 -] 2 3 A 5) 6 7 | ‘8 9 (normal 
curve)* 
7 

- 1-00 00 00 | 00 00 00 00 00 00 00 | 75 1 ‘0 

- 95) — — — — — — 76 2) — 

- 90); — _ — — — — 7 5) — 

- 85) — — -- — -- oS — = = 78 14] — 

=- 80); — — _ — ~- — ~- _- — |. :79 4-0 ‘0 

- 75 | — — — -- — |2@ -80 11:3 2 

- -70| — _ — — ~- -- — — | — | & -81 31:3 3 | 

= 65; — ~ — — — | = Be =82 84-7 3-2 

= 60; — — — — a = = —. | % -83| 221-8 25:3 | 

— 05 — ee — | — = = — | — |aq -84 556-2 150-2 
=| - -50 00; — | — — = |) = [22 85 | 1820-0) .678-6 
3 | - -45 08 00; — -- ~ — — — — | § -86) 2919-0] 2330-1 | 
7 | - -40 ‘91 ‘Ol | = = = — |S -87| 5895-9} 6082-6 
3 | - -35 7-43 15) 003}, =, —— = 2 = — (& -88  10597-3 | 11944-8 

- -30| 42-60 1-41 | 02a — | — | &  -89 | 16373-8 | 182113 | 
a} - -25| 177-84 9:50 18 00) — iil C= = = | S +90 | 20754-4 | 20887-0 
4 | - -20| 555-18] 48:00) 1-55 02| — = = — | — |~  -9f | 20233-5 | 18211-3 
2 | — -15 | 1321-36 | 185-37| 9-53 16 2008 eee eye — | — |g -92 | 13848-4 | 11944:8 | 
3 | — -10 | 2431-67 | 554-86) 45-84 1-24 | ‘Ole = — | — |-@ -93| 5823-1} 6082-6 
D | — -05 | 3493-29 | 1299-62 | 173-78 7-57 | ‘OOM = = = ee — | — |'& -94| 1227-8 | 2330-1 
3 -00 | 3989-27 | 2395-29 | 522-21 | 36-98 | ‘70. (0) = = | > 95) 93-7| 678-6 
3 | + -05 | 3493-29 | 3480-22 1246-23] 14532) 456 03) = ==) 96) 55) 150-2 
~ | + -10 | 2431-67 | 3979-11 | 2358-16 | 458-62) 24-21 | one y= — | = 97| — | 258 
3 | + -15 | 1321-36 | 3560-42 3519-22] 1156-81 | 104-66 1-87 | 00) — = 98| — | 3-2 | 
S| + -20 | 555-18 | 2469-56 | 4103-61 | 2311-65 | 365-76 | 11-67 Ol. — | 99| — | 3 
S| + -25 | 177-84 | 1309-30 | 3687-36 | 3611-32 | 1021-01) 59:96] -41) — = £00) 2 
| + 30 | 42-60 | 520-41 | 2504-77 | 4329-36 | 2237-43 | 249-60 3-35 ‘00 — 
~| + +35 7-43 | 151-11 1253-64 | 3884-32 | 3759-08 | 824-50 22-96 03 = 

+ -40| 91 | 30-98 | 446-88 | 2522-86 4690-55 | 2099-42 | 127-86 351) == 

+ 45 08 4:29 108-49 | 1125-08 | 4166-27 | 3962-12 | 558-78 3-87 — | 

+ +60 00 | -38| 16-90 | 333-60 | 2488-44 | 5254-02 | 1829-25] 35-59 | 00 

+ $5) — 02) 156 | 59-16 | 925-05 | 4549-13 | 4200-10 | 254-84 02 

+60) -— 00 08 5:68 | 192-25 | 2320-57 | 6157-92 | 1316-29 2-01 | 

+ 65) — — | ‘00 25 19:17] 601-45 | 5023-96 | 4363-45 38-45 | 

+ i = | = 00 ‘73 | 63-47 | 1850-41 | 7728-87 | 519-97 | 

+ -75| —- = |S — ‘Ol| 1-92 | 220-19 | 5408-23 | 3951-47 | 

+ -80| — hs laeeeae — | 00 ‘Ol 4:76 | 876-76  10951-32_ 

+ 85 | — — 00 Ol) 11-56 4488-65 | | 

+ 90) — _ — | 00 00) 29-37 | 

+ 95 = — ==") -00 

+1-00 00 00 00 00 00 00 | 00 00 | 00 


* These ordinates indicate how poor is the approximation of a normal curve to the frequency of r, 
where » is 100, but p is large. 


For the constants of the curves for n= 100: see p. 372 above. 
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TABLE B*. To assist the calculation of the Ordinates of the Correlation Frequency Curves 
from Expansion Formulae. 


log Naee, ra 
Vn—1 log (1—p’)=0. 
n | log ee log Nate n tog int ] ] | 
mei | al Ss 270) 08 X2 pi be fs pa 


Teta | 42 | 7956681} 81 | -9460821 | —-95 | -5054977 | 2-8825969 | | 
-0624694 | 43 | -8011592] S82 | -9488475 | —-90 | 3606232 | 1-3172203 | 

15 
Re 


3 

4 

5 | -1760913 § 44 | -8065151 | 83 | -9515781 | —-85 | -2783685 639844 | 
6 | -2525750 | 45 | -8117421} 84 | -9542748 | —-80 | -2218487 7335437 
7 | -3098944 | 46 | -8168464} $5 | -9569384 | —-75 | -1795110 | 1-8605570 | 
8 | -3556022 § 47 | -8218336 — 86 | -9595698 | —-70 | -1462149 | 1-9604452 
9 | -3935530 | 48 | -8267089} S87 | -9621697 | —-65 | -1192240 | -0414179 
10 °8314772 | S8 | -9647388 | —-60 | 0969100 | -1083599 


-4259687 


i 
Ss 


| 11 | -4542425 | 50 | -8361432 | S89 | -9672779 | —-55 | -0782279 | 1644063 | | 
12 | -4793037 | 51 | 8407111 | 90 | -9697877 | —-50 | 0624694 | -2116818 | | 
13 | ~ -5018021 | 52 | -8451849 | 97 | -9722688 | —-45 | -0491347 | -2516860 | 
14 | -5222096 | 53 | -8495685 | 92 | -9747218 | —-40 | -0378604 | -2855089 | 
15 | -5408793 | 54 | 8538654 | 93 | -9771475 | —-35 | -0283764 | -3139606 | 
16 | -5580824 | 55 | -8580790 | 94 | -9795464 | —-30 | -0204793 | -3376520 
17 | -5740313 | 56 | 8622124 | 95 | -9819190 } —-25 | 0140144 | -3570469 
18 | -5888955 | 57 | -8662687 | 96 | -9842661 | —-20 | 0088644 | -3724968 


u a 
19 | -6028127 } 58 | -8702506 | 97 | -9865880 f —-75 | -0049416 | -3842651 | | a 78 
20 | -6158957 | 59 | 8741609] 98 | -9888854 ] —-70 | -0021824 | -3925427 | te = S 
21 | -6282386 | 60 | -8780020} 99 | -9911587} —-05 | 0005435 | -3974593 | N 3 eS 
22 | -6399203 f 67 | 8817764 | 100 | -9934085 00 |:0 | -3990899 | & | 8 i ; 
23 | -6510080 ] 62 | -8854863 | 200 | 1-2993966 | +-05 | 0005435 3974593 | i | i i i 
24 | -6615588 | 63 | -8891340 +-10 | 0021824 | -3925427) —« | w» = = 
25 | -6716222 | 64 | -8927214 4-15 | -0049416 | -3842651) @& | © = 
| 26 | -6812412 ] 65 | -8962506 +-20 | 0088644 | -3724968 | 
| 27 | -6904533 | 66 | -8997233 +:25 | -0140144 | -3570469 
28 | -6992915 | 67 | -9031414 +:30 | -0204793 | -3376520 
29 | -7077847 | 68 | -9065065 | +:35 | -0283764 | -3139606 
30 -7159590 F 69 | -9098203 +-40 | -0378604 | -2855089 
31 | 7238374 | 70 | -9130844 +:45 | 0491347 | -2516860 
32 | -7314404 | 77 -9163001 +:50 | -0624694 | -2116818 
33 -7387867] 72 | -9194689 +:55 | 0782279 | -1644063 | 
34 -7458930 4 73 | -9225921 +-60 | -0969100 | -1083599 
35 7527745 | 74 | -9256711 +-65 | 1192240 | -0414179 
36 -7594449 § 75 | -9287070 | +-70 | -1462149 | 1-9604452 
| 37 | -7659168 | 76 | -9317011 +:75 | -1795110 | 1-8605570 | 
| 38 | -7722016 ; 77 | 9346545 +89 | 2218487 | 1-7335437 | 
, 89 | -7783099 ; 78 | -9375682 | +°85 | -2783685 | 1-5639844 
' 40 | 7842513 79  -9404434 | +90 | 3606232 | 1-3172203 
41 | +7900346 - SO | 943281] +:95 | -56054977 | 2-8825969 
| i 
* Tf the ordinate at r be y, then (see p. 348 
a7, an ps 3 ; py 
va (14 Men Gale (isle : att) 
_9 Z 
where log Y=log = + log (1 —p?)2 —(n -1) log x, — log x». 
nN — 


All these quantities are given for r= -—-95 to +-95 and p=0-0 to 0:9 in this Table B. 
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TABLE B. Yo assist the calculation of the Ordinates of the Correlation Frequency Curves 
from Expansion Formulae. 


p='l. p="2. 
log (1—p Daal -9934528. log (1 — p?)? =1-9734068. 
r log x1 log x2 dg. | de ps | fa r | log x1 log x2 pi 


| | | | 


| =-95  -5470942 | 1-8563426  -238125  -0229783 | —-0364766 —-0013075 | —-95  -5899090 | 1-8182302 | -22625 


| = -90  -4002321 | 1-2919599  -23875  -0233820 | — 0366462 — -0017344 | —-90 | -4413696 | 1-2546862 | -2275 
~-85 | -3159806 | 1-5397223 -239375  -0237893 | --0368123  —-0021666 | --85 | -3554188 | 1-5032983 | -22875 

| --80 | 2574549 | 1-7102846 | -24 0242 | ~ 0369750 — 0026040 | —-80 | -2951711 | 1-6747215 | -23 

| —-75 | 2131018 | 1-8383056 | -240625 -0246143 | —-0371342 | —-0030466 | —-75 | -2490732 | 1-8036150 | -23125 

| --70 | 1777811 | 1-9392061 | -24125  -0250320 | —-0372898 | —-0034943 | --70 | 2119841 | 1-9053996 | -2325 


| —-60 | -1243983 | -0891598” -2425 


| 4-15 | 0005603 | -3809998 | -251875 | 0326721 | —-0393354  —-0118089 | +-15 | 0005777 | -3642861 | -25375 
| +:20 | 0022729 


+:40 | 0223140 | -2878260 -255 0351125 | —-0396984 | —-0144700 | +-40 | -0105126 | -2770218 | -26 
+:45 | -0313204 | -2551371  -255625  -0356111 | —-0397568 | —-0150118 | +-45 | 0170404 | -2455721 | -26125 
+:40 | 0423754 | -2162728  -25625 | -0361133 | —-0898102 | —-0155564 | +-50 | -0255763 | -2079674 | -2625 

| +:55 | -055842]  -1701431 | -256875 | -0366189 | —-0398587 | —-0161038 | +-55 | -0364823 | -1631181 | -26375 
+-60 | -0722203 | °1152488 -2575 | -03871281 | —-0399021 | —-0166540 | +-60 | -0502571 | -1095254 | -265 

| +-65 | 0922180 | _-0494649 | -258125 | -0376408 | — -0399406 | —-0172067 | +-65 | -0676076 | _-0450641 | -26625 
+70 | -1168808  1-9696565 | -25875 | -0881570 | - 0399739 | —-O0177618 | +-79 | -0895777 | 1-9666028 | -2675 
+:75 | 1478351  1-8709390 | -259375 | -0386768 | —-0400021 | —-0183193 | +-75 | -1177943 | 18692544 | -26875 
+:80 | -1878190 | 1-7451026 | -26 “0392 — -0400250 | —-0188790 | +-8O | -1549924 | 1-7448109 | -27 
+:85 | -2419720  1-5767267  -260625 | -0397268 | —-0400427 | —-0194408 | +-85 | -2063110 | 1-5778522 | -27125 


| +-90 | 3218470 | 1-3311524 | -26125 | -0402570 | — -0400550 
| 4-95 | 4643287 2-8977254 -26187! 


1487560 | -0211959 | -241875  -0254533 | —-0374417 | —-0039470 | —-65 | -1811668 | 1-9882856 | 23375 
-0258781 | —-0375900 | —-0044047 | —-60 | -1549924 | -0571577 | -235 
—-55 | 1036628 | -1462328 | -243125 | -0263064 | —-0377346 | —-0048672 | —-55 | -1324153 | -1151516 | -23625 
—-50 | 0858411 | -1945400 | -24375 | -0267383 | —-0378754 | —-0053345 | —-50 | -1127264 | -1643923 | -2375 
—-45 | -0704333 | -2355806 | -244375 | -0271736 | —-0380123 | —-0058066 | —-45 | -0954255 | -2063796  -23875 | 
—-40 | 0570761 | -2704450 | -245 0276125 | —-0381453 | —-0062833 | —-40 | 0801485 | -2422038 . -24 

— +35 | 0454992 | -2999432  -245625 | -0280549 | —-0382744 | —-0067646 | —-35 | -0666246 | -2726756 , -24125 
—-30 | -0354989 | -3246862  -24625 | -0285008 | — -0383994 | —-0072503 | —-30 | -0546496 | -2984059 , -2425 
—:25 | 0269206 | -3451377  -246875 | -0289502 | —-0385204 | —-0077405 | —-25 | -0440680 | -3198590 | -24375 | 
—-20 | -0196470 | -3616495 | -2475 ~ | -0294031 | —-0386373 | —-0082349 | —-20 | -0347621 | -3373870 | -245 || 
—-15 | 0135901 | -3744849 | -248125 | -0298596 | --0387500 0087336 | —-15 | 0266432 | -3512533 | -24625 


| 
> 
on 


| 


—-10 | 0086862 | -3838348 -24875 | -0303195 | —-0388585 | —-0092363 | —-70 | -0196470 | -3616495 | -2475 | 
—-05 | 0048920 | -3898291 | -249375 | 0307830 | —-0389627 | --0097431 | —-05 | -0137293 | -3687055 | -24875 | 
00 | -0021824 | -3925427 | -25 | 03125 — -0390625 | —-0102539 00 | -0088644 | -3724968 | -25 
+:05 | 0005490 | -3920005  -250625 | -0317205 | —-0391579 | —-O107685 | +-05 | -0050431 | -3730485 | -25125 
+-10 | -0 | -3881779 | -25125 | -0321945 | —-0392490 | — 0112869 | +-70 | 0022729 | -3703365 | -2525 


3703365 | +2525 | 0331531 | —-0394174 | —-0123344 | +-20 | -0 +3547680 | -255 

5 | 0052014 | -3559973  -253125 | 0336377 | —-0394947 | —-0128634 | +-25 | 0006024 | -3415919 | -25625 
-30 | -0094334 | -3377189 -25375 | -0341258 | —-0395674 | —-0133957 | +-30 | -0024715 | -3244949 | -2575 
+35 | -0150862 | -3151498  -254375 | -0846174 | —-0396353 | —-0139313 | +-35 | 0057238 | -3031260 | -25875 


-0200045 | +-90 | -2833014 | 1-3337203 | -2725 
-0407908 | —-0400620 | —-0205702 | +-95 | -4228471 | 2-9017612 | -27375 


oa 
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TABLE B—(continued). 


P =*s4 p = 
log (1 — p2)? =1-9734068. log (1 — p2)? =1-93856 
pe ps pa aa log x1 log x2 pi 2 ps pa 
= _ -| a _ = 
0159758 | —-0326811 | +-0057193 | —-95 | -6348801 | 2-7667074 | -214375 | 0102424 -0280527 | +-0104960 
-0166531 | —-0331264 | +-0050816 | —-90 | -4849062 | 1-2038806 | -21625 | 0110633 ‘0288235 | +-0098985 
0173445 | —-0335620 | +-0044192 ]| —-85 | -3974915 | 1-4532246 | -218125 | -0119158 -0295824 -0092418 
‘01805 —-0339875 | +-0037321 | --8O | -3357497 | 1-6253949 | -22 -0128 -0303250 | +-0085260 
-0187695 | —+0344025 | -+-0030208 | — 745 | -2881264 | 1-7550511 | -221875 | -0187158 ‘0310528 | +-0077516 
0195031 | —-0348065 | +-0022856 | —-70 | -2494796 | 1-8576146 | -22375 | -0146633 -0317633 | +-0069189 
*0202508 | —-0351990 | +-0015269 | —-65 | -2170712 | 1-9412961 | -225625 | -0156424 -0324549 | +-0060287 
‘0210125 | —-0355797 | +-0007451 | —-60 | -1892713 | -0109810 | -2275 -0166531 -0331264 | +-0050816 
-0217883 | —-0359481 | —-0000593 } —-55 | -1650331 | -0698054 | -229375 | -0176955 -0337760 | +-0040787 
-0225781 | —-0363037 | —-0008860 } —-50 | -1436465 | -1198950 | -23125 | -0187695 ‘0344025 | +-0030208 
-0233820 | —-0366462 | —-0017344 | —-45 | -1246098 | -1627501 | -233125 | -0198752 -0350042 | +-0019092 
-0242 — 0369750 | —-0026040 | —-40 | -1075577 | -1994619 | -235 ‘0210125 -0355797 | +-0007451 
-0250320 | ---0372898 | —-0034943 | —-35 | -0922180 | -2308416 | -236875 | -0221814 -0361275 | —-C004699 
-0258781 | —-0375900 | —-0044047 | —-30 | -0783851 | -2575009 | -23875 | -0233820 -0366462 | —-0017344 
-0267383 | —-0378754 | —-0053345 | —-25 | -0659021 -2799047 | -240625 | -0246143 ‘0371342 | —-0030466 
-0276125 | —-0381453 | —-0062833 }| —-20 | -0546496 | -2984059 | -2425 -0258781 ‘0375900 | — -0044047 
‘0285008 | — -0383994 | —-0072503 | —-15 | -0445372 | -3132690 | -244375 | -0271736 ‘0380123 | —-0058066 
‘0294031 | —-0386373 | —-0082349 | —-0 | -0354989 | -3246862 | -24625 | -0285008 -0383994 | —-0072503 
-0303195 | —-0388585 | —-0092363 }| —-05 | -0274889 | -3327884 | -248125 | -0298596 ‘0387500 | —-0087336 
-03125 — -0390625 | —-0102539 -00 | -0204793 *3376520 | +25 -03125 -0390625 | —-0102539 
‘0321945 | —-0392490 | —-0112869 | +-05 | -0144591 -3393033 | -251875 | -0326721 -0393354 | —-0118089 
‘0331531 | —-0394170 | —-0123344 | +-20 | -0094334 | -3377189 | -25375 | -0841258 -0395674 | —-0133957 
‘0341258 | —-0395674 | —-0133957 | +-75 | -0054243 3328255 | -255625 | -0356111 ‘0397568 | —-0150118 
‘0351125 | —-0396984 | —-0144700 }| +-20 | 0024715 | -3244949 | -2575 ‘0371281 -0399021 | —-0166540 
‘0361133 | —-0398102 | —-0155564 | +-25 | -0006354 | -3125381 | -259375 | (0386768 ‘0400021 | —-0183193 
‘0371281 | —-0399021 | —-0166540 | +-30 | -0 -2966934 | -26125 | -0402570 -0400550 | — -0200045 
-0381570 | —-0399739 | —-0177618 | +-35 | -0006788 | -2766112 | -263125 | -0418689 -0400595 | — -0217064 
“0392 — 0400250 | —-0188790 | +-40 | 0028223 | -2518296 | -265 -0435125 -0400141 | — -0234213 
-0402570 | —-0400550 | —-0200045 | +-45 | -0066301 | -2217400 | -266875 | -0451877 ‘0399172 | —-0251457 
-0413281 | —-0400635 | —-0211375 |] +-50 | -0123676 | -1855345 | -26875 | -0468945 ‘0397675 | —-0268757 
6424133 | —-0400500 | —-0222767 | +-55 | -0203937 | -1421251 | -270625 | -0486330 -0395633 | —-0286077 
-0435125 | —-0400141 | —-0234213 | +-60 | -0312032 | -0900151 | -2725 -0504031 -0393033 | —-0303374 
0446258 | —-0399553 | —-0245700 | +-65 | 0454992 | _-0270821 | -274375 | -0522049 ‘0389860 | —-0320608 
(0457531 | —-0398732 | —-0257219 | +-70 | -0643213 | 1-9501937 | -27625 | -0540383 -0386098 | —-0337735 
0468945 | —-0397675 | —-0268757 | +-75 | -0892920 | 1-8544683 | -278125 | -0559033 ‘0381733 | —-0354711 
°04805 —:0396375 | —-0280304 | +-SO | -1231416 | 1-7316990 | -28 -0578 -0376750 | —-0371490 
0492195 | —-0394829 | —-0291847 | +-85 | -1710041 | 1-5664684 | -281875 | -0597283 ‘0371134 | —-0888026 
0504031 | —-0393033 | —-0303374 | +-90 | -2444254 | 13241210 | -28375 | -0616883 ‘0364871 | —-0404269 
‘0516008 | —-0390982 | —-0314873 | +-95 | -3802830 | 1-8940059 | -285625 | -0636799 -0357945 | —-0420171 
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TABLE B. To assist the calculation of the Ordinates of the Correlation Frequency Curves 
from Expansion Formulae. 


“4, p—d: 
log (1 — p 22 = 1-8864189. log (1—p 2)2 = 1-:8125919. 
Us log x1 loz x2 pe hs pa log x4 log xs pi 
| —-95 | -6832371 | 2 699076 -0057781 | —-0229682 | +-0129113 -7367591 | 2-6107927 | -190625 
—-90 | -5320225 | 1-1 opens -0066125 | --0240578 | +-0125970 -5844606 | 1-0491282 | -19375 
| —:85 | -4433337 | 1:3868509 0075031 | —-0251404 | +-0121802 -4946527 | 1-2996689 | -196875 
— +8) | -3802830 | 1-5596756 ‘00845 — -0262125 ‘0116596 -4304462 | 1-4730716 | -2 
| —-75 | -3313147 | 1-6900043 -0094531 | —-0272705 ‘0110342 -3802830 | 1-6039976 | -203125 
—-70 | -2912852 | 17932591 -0105125 | —-0283109 ‘0103034 -3390180 | 1-7078702 | -20625 
~ 65 | -2574549 | 1:8776516 | -0116281 | —-0293303 -0094672 3039093 | 1-7929019 | -209375 
| —-60 | -2281921 | 1-9480680 ‘0128 | —-0303250 -0085260 -2733227 | 1-8639801 | -2125 
—-55 | -2024481 | -0076453 -0140281  —-0312916 -0074804 2462074 | 1:9242431 | -215625 
| —-60 | -1795110 | -0585101 ‘0153125 | —-0322266 -0063318 -2218487 | 1-9758187 | -21875 
—-45 | -1588770 | -1021639 -0166531 | —-0331264 -0050816 ‘1997401 | -0202098 | -221875 
—-40 | -1401787 -1396988 ‘01805 —:-0339875 ‘0037321 -1795110 | -0585101 | -225 
—-35 | -1231416 | -171927]1 | -0195031 | —-0348065 | +-0022856 | 1608837 | -0915336 | -228125 
| —-30 | -1075577 | -1994619 | -2 ‘0210125 | —-0355797 -0007451 -1436465 | -1198950 | -23125 
—-25 | -0932674 *2227694 | - -0225781 | —-0363037 ‘0008860 -1276363 | -1440625 | -234375 
| —:20 | -0801485 -2422038 | “0242 | —-0369750 -0026040 -1127264 -1643923 | -2375 
—-15 | 0681078 | -2580318 | -025878i | —-0375900 -0044047 -0988194 | -1811527 | -240625 
| —-10 | -0570761 -2704450 -0276125 | —-0381453 -0062833 ‘0858411 *1945400 | -24375 
| —-05 | -0470041 | -2795781 | -0294031 | —-0386373 -0082349 -0737368 | -2046893 | -246875 
| -00 | 0378604 *2855089 | - -03125 — 0390625 ‘0102539 -0624694 | -2116818 | -25 
+:05 | -0296300 | -2882652 -0331531 | —-0394174 ‘0123344 -0520175 | -2155489 | -253125 
| +:10 | -0223140 -2878260 | ‘0351125 | —-0396984. ‘0144700 0423754 -2162728 | :25625 
+:15 | -0159298 | -2841201 | -25 0371281 | —-0399021 0166540 -0335527 | -2137861 | -259375 
| +:29 | 0105126 | -2770218 | -26 “0392 | —-0400250 -0188790 -0255763 | -2079674 | -2625 
| +:25 | -0061172 *2663445 | 5 | -0413281  —-0400635 -0211375 ‘0184918 | -1986347 | -265625 
+30 | -0028223 | -2518296 | - -0435125 | —-0400141 -0234213 ‘0123676 | -1855345 | -26875 
++35 | -0007352 | -2331303 (0457531 | —-0398732 -0257219 -0072998 | -1683255 | -271875 
+-40 | -0 | -2097881 | - 04805 = | —-0396375 “0280304 -0034197 | -1465558 | -275 
+-45 | -0008089 | -1811980 | -2 | -0504031 | —-0393033 -0303374 -0009057 | -1196270 | -278125 
+°50 | -0034197 | -1465558 -0528125 | —-0388672 -0326331 0 -0867431 | -28125 
+:55 | 0081829 | -1047779 | ‘0552781 | —-0383256 -0349072 0010353 | _-0468291 | -284375 
+60 | -0155840 | _-0543720 | -2 ‘0578 —-03767 50 -0371490 -0044774 | 1-9984028 | -2875 
| +:65 | -0263161 | 1-9932210 | ‘0603781  —-0369119 -0393475 -0109971 | 1-9393579 | -290625 
| +:70 | -0414078 | 1-9181979 | - 0630125  —-0360328 0414911 -0215976 | 1-8665804 | -29375 
+°75 | 0624694 | 1-8244269 0657031 | —-0350342 | — 0435679 ‘0378604 | 1:7752089 | -296875 
+:80 | -0922180 | 1-7037082 06845 —-0339125 | —-0455654 0624694 | 1-6570690 | -3 
| +°85 | -1357728 | 5406314 : ‘0712531  —-0326643 ‘0474709 -1005057 | 1-4967424 | -303125 
+:90 | -2046635 | 1-3005493 | -2¢ | -0741125 | —-0312859 -0492710 -1634553 | 1-2596309 | 30625 
+-95 | -3357497 | 2:8728199 | -25 -0770281 | —-0297740 | —-0509521 -2881264 | 2:8351091 | 309375 


ore 
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TABLE B—(continued). 


p='d. pO: 
log (1 — p2)? = 1-8125919. log (1 — p2)? =1-7092700. 
co) ds Pa r logxi | logy, | gi | bz ps | pa 


| = | 
| 0025830 | -—-0178043 | +-0130733 | —-95 | -7983074 | 2-4939170 | -17875 | 0006570 | —-0129380  +-0113080 
2: 


0033008 | —-0191498 | +-0132346 | —-90 | -6450539 | 2-9327299 | -1825 | -0011 2 —:0148103  +-0120455 
0041064 | —-0205090 | +-0132540 | —-85 | -5542555 | 1-1837659 | 18625 | -0017258 | —-0159569 | +-0126231 
005 — 0218750 | +-0131250 | —-80 | -4890205  1-3576828 | -19 00245 —-0175375  +-0130246 
0059814 | —-0232410 | +-0128423 | —-75 | -4377890 | 1-4891430 | -19375 | -0033008 | —-0191498  +-0132346 
-0070508 | —-0246002 | +-0124015 | —-70 | 3954124 | 15935710 | -1975 | -0042781 | —-0207818 + 0132403 
-0082080 | — -0259457 | +-0117995 } —-65 | -3591488 | 1-6791805 | -20125 | 0053820 | —-0224218  +-0130306 
0094531 | —-0272705 | +-0110342 | —-60 | -3273589 | 1-7508604 | -205 | 0066125 | —-0240578 ‘0125970 


0107861 | —-0285679 | +-0101042 § —-55 | -2989895 | 1-8117504 | -20875 | -0079695 | —-0256780  +-0119330 


-0122070 | —-0298309 | +-0090097 | —-50 | -2733227 | 1-8639801 | -2125 | -0094531 | —-0272705 ‘0110342 
0137158 | —-0310528 | +-0077516 | —-45 | -2498484 | 1-9090541 | -21625 | -01109633 | —-0288235 | +-0098985 
0153125 | —-0322266 | +-0063318 | —-40 | -2281921 | 1-9480680 | -22 | 0128 —-0303250 | +-0085260 


‘0169971 | —-0333454 | +-0047535 | --35 | 2080718 | 1-9818379 | -22375 | -0146633 | —-0317633 | +-0069189 
0187695 | —-0844025 | +-0030208 | —-30 | -1892713 | -0109810 | -2275 | -0166531 | —-0331264 | +-0050816 


0206299 | —-0353909 | +-0011389 | —-25 | -1716222 | -G359679 | -23125 | -0187695 | —-0344025 | +-0030208 
0225781 | —-0363037 | —-0008860 | —-20 | -1549924 | -0571577 | -235 0210125 | —-0355797 | +-0007451 
0246143 | —-0371342 | —-0030466 | —-75 | -1392781 | -0748218 | -23875 | -0233820 | —-0366462 | —-0017344 


0267383 | —-0378754 | —-0053345 | —-10 | -1243983 | -0891598 | -2425 | -0258781 | —-0375900 | —-0044047 

0289502 | —-0385204 | —-0077405 | —-05 | -1102908 | -1003106 | -24625 | 0285008 | —-0383994 | —-0072503 

‘03125 — -0390625 | —-0102539 -00 | 0969100 | -1083599 | -25 | 03125 — -0390625 | —-0102539 

0336377 | —-0394947 | --0128634 05 | 0842253 | -1133434 | -25375 | -0341258 | —-0395674 | —-0133957 

0361133 | —-0398102 | —-0155564 -10 | 0722203 | -1152488 | -2575 | -0371281 | —-0399021 | —-0166540 
12 


| (0386768 | —-0400021 | —-0183193 -15 | -0608930 | -1140143 | -26125 | -0402570 | —-0400550 | — -0200045 
0413281 | —-0400635 | —-0211375 -20 | -0502571 | -1095254 | -265 0435125 | ---0400141 | —-0234213 
0440674 | —-0399876 | —-0239952 -25 | -0403433 | -1016073 -0468945 | —-0397675 | —-0268757 
| 0468945 | —-0397675 | —-0268757 30 | -0312032 | -O900151 | -2725 | -0504031 | —-0393033 | --0303374 
0498096 | —-0393963 | —-0297613 -39 | -0229135 | -0744170 | -27625 | -0540383 | —-0386098 | —-0337735 


bo 
aa 
or) 
fo"e) 
~] 
oO 


t++tt++t4t+ 


0528125 | —-0388672 | —-0326331 -40 | 0155840 | -0543720 | -28 -0578 —-0376750 | —-0371490 
05590383 | —-0381733 | —-0354711 | +-45 | 0093675 | _-0292945  -28375  -0616883 | —-0364871 | —-0404269 
0590820 | —-0373077 | —-0382544 | +-50 | 0044774 | 1-9984028 | -2875 | -0657031 | —-0350342 | —-0435679 
0623486 | —-0362637 | —-0409610 | +-55 | 0012127  1-9606388 | -29125  -0698445 | —-0333044 | —-0465305 
| 0657031 | —-0350342 | —-0435679 f +-60 | -0 | 1:9145399 | -295 | -0741125 | ~-0312859 | — -0492710 
0691455 | —-0336124 | —-0460509 | +-65 | -0614639 | 1-8580230 | -29875 | -0785070 | —-0289669 | —-0517436 
(0726758 | —-0319916 | —-0483849 70 | 0065529 | 1-7880012 | -3025 | -0830281 | —-0263353 | —-0539003 
0762939 | —-0301647 | —-0505437 | +-75 | -0167837 | 1-6996456 | -30625 | -0876758 | —-0233795 | —-0556908 
08 — 0281250 | —-0525000 f +-80 | -0347621 | 1:5848120 | -3 09245 —-0200875 | —-0570629 
0837939 | —-0258656 | — -0542255 | +-85 | -0654746  1-4281563 | -31375 | -0973508 | —-0164474 | —-0579619 
-0876758 | —-0233795 | —-0556908 | +-90 | -1202910 | 1-1951114 | -3175 | -1023781 | —-0120568 | — 0583312 
0916455 | —-0206600 | —-0568656 ff +-95 | -2358762  2-7751326 | 82125 | -1075320 | —-0080757 | --0581117 
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TABLE B. To assist the calculation of the Ordinates of the Correlation Frequency Curves 
from Expansion Formulae. 


| +:60 | 0065529 


p=". (oe oe 
log (1 — p?)? = 1-5613553. log (1 — p2)? = 1-3344538. 
| 1 } 
Lo | log xa log x2 fi pe ds | a r log x1 logx2 | $1 


| --95 | -8731268 


3332450 | -166875 | -0000002 | —-0087458 | +-0081651 | —-945 | -9728591 | 2-0942943 | -155 
—-90 | -7190257 | 2-7724818 | -17125 | -0000945 | —-0101880 | +-0094402 } - -90 | -8180004 | 2-5339099 | -16 
—*85 | -6273441 | 1-0239593 | -175625 | 0003611 | —-0117540 | +-0105881 | —-85 | -7255265 | 2-7857835 | -165 
— -80 | -5611883 "1983367 18 0008 ~-0134250 | +-0115710 | —-80 | 6585413 | 2-9605755 | -17 
| —°75 | 5089957 seed 184375 | 0014111 | —-0151821 | +-0123553 | --75 | -6054797 | 1-0929508 | -175 
—-70 | -4656161 4352073 | -18875 | 0021945 | —-0170065 | +-0129113 | —-70 | -5611883 | 1-1983367 | -18 
— 65 | -4283019 | 1-5213417 | -193125 | -0031502 | —-0188794 | +-0132135 | —-65 | 5229163 | 1-2849499 | -185 
— 60 | -3954124 “5935710 | -1975 0042781 | —-0207818 | +-0132403 | —-60 | -4890205 | 1-3576828 | -19 
— 55 | 3658926 | 1-6550366 | -201875 | 0055783 | —-0226951 | +:0129741 | —-55 | -458439] | 1-4196788 | -195 
—+50 | -3390180 | 1-7078702 | -20625 | -0070508 | —-0246002 | +-0124015 }| —-50 | -4304462 | 1-4730716 | -2 


‘7535784 | -210625 | 0086955 | —-0264785 | +-0115131 | —-45 | -4045223 | 1-5193703 | -205 
“7932591 | -215 0105125 | —-0283109 | +-0103034 | —-40 | -3802830 | 1-5596756 | -21 
one 219375 | -0125018 | —-0300788 | +-0087711 | --30 | 3574352 | 1-5948094 | -215 
576146 | -22375 | -0146633 | —-0317633 | +-0069189 | —-30 | -3357497 | 1-6253949 | -22 
88338 32 -228125 | -0169971 | —-0333454 | +-0047535 | —-25 | 3150444 | 1-6519100 | -225 
PANES | -2325 0195031 | —-0348065 | +-0022856 | —-20 | -2951711 | 1-6747215 | -23 
9239392 | -236875 | -0221814 | —-0361275 | —-0004699 | —-75 | -2760084 | 1-6941098 | -235 
9392061 | -24125 | -0250320 | —-0372898 | —-0034943 | —-10 | -2574549 | 1-7102846 | -24 
951 3444 | -245625 | -0280549 | — -0382744 | —-0067646 | —-05 | -2394256 | 1-7233964 | -245 


—-45 | -3142753 
-—-40 | -2912852 
—-35 | -2697607 
—-30 | -2494796 
—-26 | -2302671 
—-20 | -2119841 
-1945188 
—:10 | -1777811 
—-05 | -1616988 


| 
~~ 
a 


00 | -1462149 | 1-9604452 | -25 03125 — -0390625 | —-0102539 00 | 2218487 | 1-7335487 | -25 
+:05 | -1312858 | 1-9665509 | -254375 | -0346174 | —-0396353 | —-0139313 ]| +-05 | -2046635 | 17407774 | -255 

| +:10 | -1168803 | 1:9696565 | -25875 | -0381570 | —-0399739 | —-0177618 | +-20 | -1878190 | 1-7451026 | -26 
+-15 | 1029796 | 1-9697088 | -263125 | -0418689 | —-0400595 | --0217064 | +-15 | -1712730 | 1-7464775 | -265 


-7448109 | -27 
*7399556 | -275 
‘7316990 | -28 
‘7197481 | -285 
“7037082 | -29 
-6830497 | -295 
-6570600 | -30 
“6247660 | -305 
5848120 | 31 
5352510 | -315 
-4731726 | 32 
3939808 | -325 


+:20 | -0895777 
+:25 | -0766832 
+:30 | -0643213 

+°35 | -0525383 
| +:40 | -0414078 
+:45 | -0310401 
+:50 | -0215976 
+°55 | -0133179 


-20 | 1549924 
-25 | -1389531 
-30 | 1231416 
-35 | 1075577 
-40 | 0922180 
-0771634 
-50 | 0624694 
| 0482647 
60 | 0347621 
65 | 0223140 
70 | 0115163 


-9666028 | -2675 -0457531 | —-0398732 | —-0257219 
‘9601751 | -271875 | -0498096 | —-0393963 | —-0297613 
sone oe | 27625 | -0540383 | —-0386098 | --0337735 
9363424 | -280625 | -0584393 | —-0374949 | —-0377031 
9181979 | -285 -0630125 | —-0360328 | —-0414911 
8951960 | -289375 | -0677580 | —-0342047 | —-0450741 
8665804 | -29375 | -0726758 | —-0319916 | — -0483849 
°8313240 | -298125 | -0777658 | —-0293748 | — -0513521 
‘7880012 | -3025 0830281 | —-0263353 | —-0539C03 
*7345749 | -306875 | -0884627 | —-0228545 | ---0559500 
6680154 | -31125 | 0940695 | —-0189134 | —-0574175 
"583565 55 | 315625 | -0998486 | —-0144932 | —-0582164 -0034197 
4731726 | -32 -1058 — 0095750 | —-0582540 ‘0 

3216122 | -324375 | -1119236 | —-0041400 | —-0574351 | +-85 | -0053672 
0944747 | 32875 | -1182195 | +-0018306 | —-0556601 | +-90 | -0296200 
6814297 | -333125 | -1246877 | +-0983557 | —-0528253 | +-95 | -1075577 


pal 
Qn 


+65 | 0018354 

+:70 | -0 

+:75 | 0024195 | | 
| +80 | 0115163 
| ++85 | 0320384 | 
| +:90 | 0750398 

4:95 | -1767574 


-++++ +++ 4+4+44 
a) 


aes) 
Sx 
SQ 


Tees Scie Se le Se ek SO Se eee ae 


Nl lest PIS ITI DSI ISIS Se SS SSIS SIS SS II ining 
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TABLE B—(continued). 
Pa 8: p= “9, 
log (1 — p’)? = 1-3344538. log (1 —~ p2)! =2-9181304. 
pe ps pr r log x1 log x2 fi pe ps pa 
0006125 | —-0056047 | +-0044070 | —-95 | 1-1344648 | 3-6665553 | -143125 | -0024939 | —-0038913 | +-0010198 
-0002 — :0067750 | +-0060060 | —-90 | -9789250 | 2-1065114 | -14875 | -0014445 | —-0045010 | +-0025068 
‘0000125 | —-0081703 | +-0075898 } —-85 | 18857364 | 2-3587425 | 154375 | -0006799 | —-0054756 | +-0042099 
-00005 — -0097625 | +-0090871 | —-80 | -8180004 | 2-5339099 | -16 “0002 —-0067750 | +-0060060 
0003125 | —-0115234 | +-0104333 | —-75 | -7641490 | 2-6666800 | -165625 | -0000049 | —-0083591 | +-0077831 
-0008 —-0134250 | +-0115710 | —-70 | -7190257 | 2-7724818 | -17125 | -0000945 | —-0101880 | +-0094402 
0015125 | —-0154391 | +-0124493 | —-65 | -6798765 | 2-8595337 | -176875 | -0004689 | —-0122216 | +-0108873 
“00245 — 0175375 | +-0130246 } —-60 | -6450539 | 2-9327299 | -b825 °0011281 | —-0148103 | +-0120455 
0036125 | —-0196922 | +-0132598 | —-55 | -6134923 | 2-9952161 | -188125 | -0020721 | —-0167425 | +-0128469 
-005 — 0218750 | +-0131250 | —-50 | -5844606 | 1-0491282 | -19375 | -0033008 | —-0191498 | +-0132346 
-0066125 | —-0240578 | +-0125970 | —-45 | -5574342 | 1-0959782 | -199375 | 0048143 | —-0216015 | +-0131629 
00845 — 0262125 | +-0116596 | —-40 | -5320225 | 1-1368698 | -205 -0066125 | —-0240578 | +-0125970 
0105125 | —-0283109 | +-0103034 | —-35 | -5079254 | 1-1726281 | -210625 | -0086955 | —-0264785 | +-0115131 
‘0128 — :0303250 | +-0085260 | —-30 | -4849062 | 1-2038806 | -21625 | -0110633 | —-0288235 | +-0098985 
0153125 | —-0322266 | +-0063318 | —-25 | -4627737 | 1-2311092 | -221875 | -0137158 | —-0310528 | +-0077516 
01805 — -0339875 | +-0037321 | —-20 | -4413696 | 1-2546862 | -2275 ‘0166531 | —-0331264 | +-0050816 
0210125 | —-0355797 | +-0007451 | —-15 | -4205607 | 1-2748976 | -233125 | -0198752 | —-0350042 | +-0019092 
-0242 —:0369750 | —-0026040 | —-70 | -4002321 | 1-2919599 | -23875 | -0233820 | — -0366462 | —-0017344 
0276125 | —-0381453 | —-0062833 | —-05 | -3802830 | 1-3060315 | -244375 | -0271736 | —-0380123 | —-0058066 
03125 — 0390625 | —-0102539 00 | -3606232 | 1-3172208 | -25 -03125 — 0390625 | —-0102539 
0351125 | —-0396984 | —-0144700 | +-05 | -3411701 | 1-3255880 | -255625 | -0356111 | —-0397568 | —-0150118 
0392 —-0400250 | —-0188790 } +-70 | -3218470 | 13311524 | -26125 | -0402570 | —-0400550 | —-0200045 | 
0435125 | —-0400141 | —-0234213 | +-75 | -3025809 | 1-3338874 | -266875 | -0451877 | —-0399172 | —-0251457 | 
04805 — -0396375 | —-0280304 f +-20 | -2833014 | 1-3337208 | -2725 -0504031 | —-0393033 | —-0303374. 
0528125 | —-0388672 | —-0326331 | +-25 | -2639393 | 1-3305264 | -278125 | -0559033 | —-0381733 | —-0354711 
-0578 —-0376750 | —-0371490 | +-30 | -2444254 | 13241210 | -28375 | -0616883 | —-0364871 | —-0404269 
0630125 | —-0360328 | —-0414911 | +-35 | -2246902 | 1-3142457 | -289375 | -0677580 | —-0342047 | —-0450741 
06845 — 0339125 | —-0455654 | +-40 | -2046635 | 1-3005493 | -295 -0741125 | —-0312859 | —-0492710 
0741125 | —-0312859 | —-0492710 45 | -1842748 | 1-2825579 | -300625 | -0807518 | —-0276909 | — -0528645 
-08 —-0281250 | —-0525000 | +-450 | -1634553 | 1-2596309 | 30625 | -0876758 | —-0233795 | —-0556908 | 
-0861125 | —-0244016 | —-0551378 | +-55 | -1421425 1: -2308910 | -311875 | -0948846 | —-0183117 | —-0575750 
09245 — -0200875 | —-0570629 | +-60 | -1202910 | 1-1951114 | -3175 -1023781 | —-0120568 | —-0583312 
0990125 | —-0151547 | —-0581467 | +-65 | -0978953 | 1-1505243 | -323125 | -1101564 | —-0057467 | — -0577622 
-1058 — :0095750 | —-0582540 | +-70 | -0750398 | 1:0944747 | -32875 | -1182195 | +-0018306  —-0556601 
1128125 | —-0033203 | —-0572424 } +-75 | -0520175 | 1-0227457 | -334375 | -1265674 | +-0103245 | —-0518057 
-12005 + 0036375 | —-0549629 | +-S0 | -0296300 | 2-9280951 | -34 +1352 | +:0197750 | —-0459690 
1275125 | +-0113266 | —-0512594 | +-85 | -0100596 | 2-7965809 | -345625 | -1441174 | +-0302222 | —-0379088 
+1352 +:0197750 | —-0459690 | +-90 | -0 2-5959739 | 35125 | -1533195 | +-0417061 | —-0273728 
1431125 | +-0290109 | —-0389219 | +-95 | -0274889 | 2-2200433 | -356875 | -1628064 | +-0542668 | —-0140979 
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TABLE C._ Position of Origin and Abscissal Unit in terms of Standard Deviation. 
The first Number gives the Position of the Origin, the second the Abscissal Unit. 


n p=0 jo oll p="2 p=o p=<4 
A 0 1116225 -2266490 -3490329 -4840501 
4 beans -0709720 0717802 0732142 0754314 
4 ! 0 -1478819 -3011028 | 4659475 6510044 
0866025 0870223 0883242 0906481 0942760 
y ( 0 1778638 -3643600 5634956 ‘7915154 
2 (- 1000000 1005663 1023258 1054792 1104330 
6 \ 0 { -2038711 | 4165411 { -6485234 ( -9145610 
1-1118034. ? -1125038 1146829 1 -1185985 | -1247759 
7 0 -2271065 | -4645401 7246924 10250662 
Ropers | Bape 1 -1258629 -1304802 -1377861 
9 ( 0 ee ea | -7942029 Re eee 
“ | (-1322876 1332252 1361468 i -1414136 1497650 
ake) 0 Re ere j ganas (Lee 
; (-1414214 142465] 1457192 1515911 1609168 
10 ( 0 | 9861133 | -5865316 hele ee 
1-1500000 1511433 ( -1547090 1611481 1713870 
11 0 { -3033102 |e Geess \ -9750815 eee 
1581139 1 -1593510 1632105 1 -1701841 1812835 
12 \ 0 ( -B196014 \ -6557863 es 0286451 ve -4670090 
i (-1658312 { -1671572 ? -1712949 1787749 1906890 
2B \ 0 oe ees ope teres 
: 1-1732051 1746157 -1790180 1869796 1996684 
1 \ 0 jeeteaes ( -7185489 jee jee 
1-1802776 -1817701 1 -1864240 1948454 2082735 
15 \ 0) ee lane et eae 
: 1-1870829 1886516 1935489 2024107 2165466 
16 | }roseso2 | | igspo9: | | sossan | | ooo7070 | | 2easees 

Q D492 *LY¥oZYZZ OC . 4222 / 

: 0 3911338 8037041 1:2637617 1-8088795 
i eacccop -2017147 2070688 2167611 2322314 
Lt) Bie \ 0 -4039379 8301762 ( 1-3052186 1-870009¢ 
18 | }.9061553 2079391 2135096 | ] -2234913 2396972 
F 0 -4163514 8558391 1-3465992 1-9292543 
ue See -2139828 2197626 -2302287 } 2469416 
20 0 fea \ 8807623 fa gepio2) eee. 
2179449 2198606 1) -2258433 2366781 2539828 
21 0 \ -4401360 9050057 \ 1-4247026 2-0427101 
a -2236068 | -2255854 2317651 ] -2429578 -2608369 
99 \ 0 ( -4515616 ( -9286218 ee 2 
ae /-2291288 { -2311687 { -2375398 2490805 2675179 
93 0 -4627064 i. 9516565 \1-4987939 i 1503024. 
2345208 -2366202 2431779 | -2550572 2740380 
4 0 4735903 -9741508 (1:5345139 ( 2-2021616 
- 2397916 2419491 2486887 1 -2608980  -2804084 
5 0 4842308 9961404 ( 1-5694295 ere 
7 2449490 2471633 2540801 | -2666117 2866387 
50 0 6995857 jee | 2-2751863 jziotee 
-3500000 3533454 3637983 | -3827476 4130572 
100 \ 0 9997548 leet 3-2571060 jce7ete 
a 1-4974937 5023813 5176589 5453562 5896685 
400 0 ie ae {4 1553109 \6-5735909 ae 
9987492 1-0087684 1:0400784. 11-0968491 1-1876817 
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TABLE C—(continued). 

n p= p=6 p="7 p=8 p=9 
: 6397035 -8298069 | 4 1-0821815 1-4670408 2-2572614 
-0787233 -0836496 | i -0914099 -1051565 -1375647 
j -8697069 11461037 | 4 1-5308211 | | 2-1592865 3-6084918 
0997391 -1080827 | | -1216223 | | -1467433 -2112748 
F 1-0657049 1-4204694. 1-9303003 2-8019830 4-9700548 
1179621 1296152 -1488972 -1857778 -2860585 
F 1-2384801 1-6646016 2-2906234 3-3941363 6-2758785 
-1342232 -1489755 -1737018 -2219447 -3579595 
P 1-3941989 1:8857159 | ( 2-6192262 3-9396570 7-4992321 
ik 1490074 1666372 ( -1964454. 2553750 4253943 
; 1-5367713 2-0886726 2-9218412 | 4-4441908 8-6363301 
‘ -1626318 -1829343 ‘2174712 | -2863637 4881142 
4 \ 1-6688524 2-2769195 3-2029089 4-9133944 9-6932562 
1 -1753174 -1981116 -2370570 “3152309 5464383 
i 1-7923354 9-4529927 | | 3-4658872 2 5:3522889 mere’ 
-1872248 2123532 | | -2554247 3.422692 -6008602 
WA 1-9086231 \2 2-6188112 37134918 | me -7651162 eos 
-1984751 2258005 2727512 | 3677294 -6518808 
6 2-0191978 me 7158616 3-9478808 6-1553948 ee 
-2092046 -2385646 -2891787 3918213 -6999496 
-1236943 2-9253194 4-1707880 6-5260261 13-2967143 
le We -2193627 -2507341 “3048225 -4147185 "7454499 
a 2-2938084 3-0681290 ! 4-3836241 | | 6-8794080 ty 
-2291249 -2623809 3197770. | | -4365653 -7887049 
te 2-3200595 3-2050605 45875494. | | 7-2175324 nae 
2385095 -2735636 3841195 | | -4574817 8299826 
ae 2-4125823 3-3367521 | | 47835297 | | 7-5420659 | Deeb 
-2475526 2843315 | | -3479156 | | -4775684. -8695312 
ap 2-5018385 | | 3-4637416 4-9723756 7-8544126 eS 
-2562879 | | -2947261 -3612200 -4969104 -9074724 
ae 2 5881420 3:5864801 | | 5-1547751 8-1557645 16-8882836 
-2647443 -3047823 | lt -3740794 5155798 -9440329 
AG 2-6717601 3-7053440 | | 53313166 | { 84471433 eae 
2729462 3145295 | | -3865338 | | -5336387 ‘9793257 
a5 2-7529222 | | 3-8206677 5-5025071 | ( 8-7294265 oe 
< -2809148 | ( -3239936 3986177 | | -5511403 1-0134685 
4 2-8318307 3-9341863 | | 5:6687868 | i 9-0033775 ie “TALLT17 
@ -2886686 -3333199 | | -4103611 5681311 1-0465598 
5 2-9086578 4-0418750 | | 5:8305409 | ( 9-2696624 ne -3216440 
fe -2962240 -3421658 | | -4217904 | 7 -5846516 1-0786891 
oe 2-9835558 4-1481734 5-9881082 | 95288659 eee 
-3035949 “3509062 -4329287 | -6007373 1:1099306 
Ou 3-0566623 4-2518695 61417876 | 9-7815042 Eee ee) 
“3107942 3594371 4437967 | | -6164198 1-1403534 
Re 3-1280980 4-3532587 6-2918434 ue 0281639 eo -9711894 
¢ “3178324 -3677825 4544124 -6317351 1-1700149 
PF 4-5677713 6-3911451 9-2990780 | {14-9462808 oe 5884211 
-4603212 5361355 6677547 | | -9876711 17584334. 
oa 6-5628599 9-2058648 ee | 21-6797456 ee 0257637 
-6587893 7696570 -9623213 | | 1-3574820 2-5594824 
13-2812041 18-6652227 27-3067557 44-1732879 94-0676645 
400 1-3293798 a 1-5566854 1-9517326 LS! 2-7620803 5:2272309 


MISCELLANEA. 


I. Tables for estimating the Probability that the Mean of a unique 
Sample of Observations lies between — oo and any given Distance of 
the Mean of the Population from which the Sample is drawn. 


By “STUDENT.” 


In the last number of Biometrika Mr Young completes the table given in Vol. x. p. 522 of 
the standard deviation frequency curves for small samples by working out the cases where the 
numbers in the sample are as low as two and three. 


In the course of his note he writes “The smallest sample considered is that of n = 4 but 
samples of two and three are of occasional occurrence, especially in physical work, and now and 
again a value of the probable error of an experimental result is deduced from a set of two or of 
three observations.” 


Further on he states “it is evident that the probable error determined from a set of three 
observations is very untrustworthy and that when there are only two observations it is very 
much worse.” 


Now in my original paper (Biometrika, Vol. v1. p. 1) I stopped at n = 4 because I had not 
realised that anyone would be foolish enough to work with probable errors deduced from a smaller 
number of observations, but now I too will complete my tables which will I think emphasise the 
moderation of the second quotation from Mr Young’s note. 


Generally speaking there are two objects in determining the standard deviation of a set of 
observations, namely (1) to compare it with the standard deviation of similar sets of observations, 
and (2) to estimate the accuracy with which the mean of the observations represents the mean of 
the population from which the sample is drawn. 


The former purpose is served by the table which Mr Young was engaged in completing, the 
latter, which is by far the most common use of the s.p., by the table which I gave in my original 
paper and which I now propose to complete downwards by including n = 2 and n = 3 and to 
extend upwards as far as n = 30. 


In the tables the probability is given (to four places of decimals) that the mean of a unique 
sample shall lie between — oo and a distance z from the mean of the population, z being measured 
in terms of the s.D. (s) of the sample. 


[By unique I mean to say that all the information which we have (or at all events intend to 
use) about the distribution of the population is given by the sample in question. ] 


To compare with the last column of the table (n = 30) I have given the corresponding prob- 


: s S. 

ability calculated from the nearest normal curve, namely the one with s.D. ——— (not ~==—— 

; Vn —3 Vn — 

as is usually given) and this shows I think that for ordinary purposes Sheppard’s tables may be 
used with n >30. 
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With regard to samples of two it will be seen that odds of 9:1 are reached at a little more 
than three times the s.p., of 99: 1 at a little more than thirty times, of 999: 1 at a little more 
than 300 times, while 9999: 1 is reached at in or about 3000 times the s.D.! 


Perhaps I may be permitted to restate my opinion as to the best way of judging the accuracy 
of physical or chemical determinations. 


After considerable experience I have not encountered any determination which is not 
influenced by the date on which it is made, from this it follows that a number of determinations 
of the same thing made on the same day are likely to lie more closely together than if the repetitions 
had been made on different days. 


It also follows that if the probable error is calculated from a number of observations made 
close together in point of time much of the secular error will be left out and for general use the 
probable error will be too small. 


Where then the materials are sufficiently stable it is well to run a number of determinations 
on the same material through any series of routine determinations which have to be made, 
spreading them over the whole period. 


Thus if an analyst is determining the percentage of nitrogen in different samples of seed 
corn and wishes to know the probable error of the determination, i.e. how accurately his 
figures give the percentage of nitrogen in a bulk of corn. 


Let us suppose that he makes ten determinations a day for sixty days and that it is of some 
real importance to him to get a clear idea of his error; he will do well to get sixty different samples 
from the same bulk of corn and analyse one of these on each of the sixty days; unless I am much 
mistaken he will have a more modest idea of his infallibility than he had before he compared 
the sixty results together. He will also, in so far as his repeated sample is representative, get 
a close approximation to the probable error of a single determination. 


In some cases it is not possible to obtain a sufficient bulk of material and then it may be better 
to determine each result in duplicate, the repetitions being separated as widely as possible in 
point of time. Then the square root of the mean of the squares of the differences between 
corresponding pairs gives twice the standard deviation of the average of a pair, and if enough 
pairs can be taken and the determinations made on different samples this is a better method 
than the other, as the error of the sampling is better sampled. 


In the preparation of the tables a slight mistake was discovered in the second row of the odd 
numbers in the original table by Mr W. L. Bowie to whom I am indebted for the calculation of 
the new figures. 
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II. On the Representation of Statistical Data. 
By L. ISSERLIS, D.Sc. 


§ 1. Ina paper entitled “On the Mathematical Representation of Statistical Data*” Professor 
Edgeworth gives several examples of the fitting of what he calls the generalized curve of error 
and some of his recent modifications of it to frequency data. The examples to which he applies 
bis methods are taken, with a single exception, from a paper by the present writer, in which 
double hypergeometrical series were fitted to frequency distributions in two variablest. Professor 
Edgeworth describes these data as “‘well swited to test and illustrate methods of representing frequency 
distributions.” Judged by the usual criterion for the testing of goodness of fit these examples 
indicate that Professor Edgeworth’s curves are decidedly unsatisfactory, but he maintains that 
the fault lies in the criterion and not in the curves. Professor Edgeworth proposes therefore 
a new criterion for the testing of the goodness of fit of theory to observation, viz. the magnitude 
of the sum S (e?/U), where U is the ratio of the frequency in any category of the data to the 
total frequency (called by Professor Edgeworth the relative frequency) and e is the difference 
between this and the theoretical relative frequency. He adds erroneously that this is Professor 
Pearson’s x? divided by the total frequency N. To quote from Professor Edgeworth’s paper 
(p. 461) ‘“‘His (i.e. Professor Pearson’s) y* being, in our notation, equal to S (N?e?)/S (NU), where 
N is the total number of observations, is V times as great as our criterion S (e?/U).” As a matter 
of fact Professor Pearson’s x? is equal to S {(m’ — m)?/m’} where m’ is the theoretical and m the 
observed frequency, and in Professor Edgeworth’s notation this would be NS (e?/w) where w is 
the theoretical relative frequency. The fact that the denominator U employed by Professor 
Edgeworth is the observed relative frequency must make the test nugatory in practice, for S (e?/U) 
will be infinite whenever the data contain one category with zero frequency. Very little difference 
results in Professor Edgeworth’s examples when the U’s are replaced by w’s in the criterion 
proposed by him, but the later part of his paper claims to give methods of curve fitting whicht 
‘not only do or may give better values for the coefficients than the use of moments, but also 
must§ give better results than moments or any other process.” Professor Edgeworth attempts in 
the section entitled ‘‘some new constructions” to achieve this highly desirable end by, as he says, 
minimising the Pearsonian criterion x”. Actually he minimises S (e?/U), a very much easier 
thing to do since the denominators are constant, but leading to equations which are altogether 
irrelevant. 

Professor Edgeworth’s proposed criterion is an absolute one, independent of the number of 
categories. The value S (e?/U) is usually small and apparently we are to judge of the goodness 
of fit by the number of zeroes between the decimal point and the first significant figure. 


In what follows we fit Pearsonian frequency curves of the appropriate type to the examples 
used by Professor Edgeworth. In addition to calculating y? and the corresponding value of P 
for each, we give also the values of S (e?/U) and S (e?/u). All these are compared with the 
corresponding values for the curves fitted by Professor Edgeworth||. It will appear that Professor 
Edgeworth’s curves are a poorer fit than the Pearsonian types not only when judged by the 
legitimate criterion but even according to the criterion proposed by Professor Edgeworth, and 
this in spite of the fact that he does not as a rule attempt to fit his curve to the whole of each 
distribution but confines himself to the central compartments. 


* Journal of the Royal Statistical Society, Vol. LXx1x. Part tv. (July, 1916). 

+ Phil. Mag. September, 1914. t Loc. cit. p. 476. 

§ Professor Edgeworth’s italics. 

|| So far as that was possible. Several of Professor Edgeworth’s calculations in the later sections 
of the paper are not finished, and some of the arithmetic would have to be rectified before completing 
the calculations. 


Miscellanea 419 


§2. Examete I. Distribution of 25,000 Deals at Whist according to the 
Number of Trumps held by the First Hand. 


TABLE I. 
Number of tramps | 0 | 1 2 3 4 5 6 > | 8. | cota | 
Frequency... | 215 / 1724 | 5262] 7440 | 371 | 2950 852 1662025000 
| 


| 
Relative frequency | -00860 | 06896 | 21048 | -29760* | -25484 | -11800 | -03408 | 00664 | -00080 | 1-00000 | 


We quote from the Phil. Mag. paper previously referred to the following values: Mean at 
324944 trumps, moments about the mean py. = 1-66130, ns = -39094, also 8, = -03333, 8, = 2-93469. 
These values lead to the Type I curve 


- 96 005 1 v 16.21927 1 x 31-75022 
Ueerai ole: 5 ( + ape) ( a 17608) 


with origin at the mode (= 3-12682 trumps) represented by the continuous line in Fig. 1. 


Frequency 


-4 3 2 ml (0) 1 2 3 4 5 6 7 8 9 10 i“ 12 13 14 15 lo 
Trumps held by First Hand 


Fie. 1. ©@©© _ Distribution of 25,000 deals at Whist according to the number of Trumps held 
by the First Hand according to theory. 


Continuous Line: Type I Frequency Curve 
7596 005 1 xr 16-21927 1 x 31. 
aE ( + eam) ( -ia700es) 


* -29776 in Prof. Edgeworth’s paper is clearly a misprint. 
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The following table gives the theoretical frequencies of this curve compared with the observed 
frequencies and the theoretical frequencies for the two curves fitted by Professor Edgeworth. 
These last being obtained by multiplying the values he gives of the theoretical relative frequencies 
by 25,000. 


TABLE II. 
] ] | 
_ Number oftrumps| -J | 90 1 2 3 4 5 6 7 & 9 | 10 | Totals} 
Observed frequency — | 215 1724 5262 7440 6371 2950 852 166 20 | — | — |} 25000 
Type I . | 4-91 | 202-85 | 1698-32 | 5201-23 | 7643-49 | 6218-30 | 2993-16 | 871-65 | 152-89 | 15-34 | 85 | -02 | 25003 
ee | ~—————_ 
Sum = 207-76 | Sum = 16-21 
Prof. Edgeworth’s : 
Ist curve, p. 461 = —- | — 5120 | 7338 6128 3028 — — — — | — — 
| Prof. Edgeworth’s | | 
2nd curve, p. 467 | — 258 1778 5093 | 7420 6183 3098 965 208 — |— | — } 25003 


For the Type I curve x? = 13-60 while n’ = 9 so that P = -0938. 

Professor Edgeworth’s first curve gives x? = 17-48, n’ = 4 and P = -000574*. His second 
curve gives x? = 41-55 for n’ = 8 so that P is less than -000001. If in the second curve we take 
the four central compartments only (i.e. 2 to 5 trumps) we have y? = 19-48, n’ = 4 so that 
P = -000222. 

Judged by the usual criterion then, both of Professor Edgeworth’s curves are a very much 
poorer fit than Pearson’s Type I. If we calculate the value of the criterion proposed by Professor 
Edgeworth we find that 

S (e2/U) is 000814 for Professor Edgeworth’s first curve (four compartments), 
00181 for eight compartments of his second curve, 
and -00077 for the four central compartments of the second curve. 
These values of S (e2/U) may be compared with -000544 for nine compartments or -000420 for 
four compartments for Pearson’s Type I. 

Professor Edgeworth says (p. 467) of this example that “the correspondence between fact 

and theory is very satisfactory.” 


Examp.e IT. The theoretical Frequency Distribution of black Balls in 210 Draws 
of four Balls out of a Bag containing four white and six black Balls. 


The frequencies are 


TABLE III. 


9 Bele 7 aloe 


0 1 | 


| 
1 24 | 90 80 15 210 


The mean is at 2-4 black balls, the moments about the mean are py = -64, wz = — -032, wy = 14 
so that 8, = :00390625, 8, = 2-747768. The corresponding Pearsonian frequency curve is of 
Type II, 


101 06905 1 ik 9.3938124 
y = 101060051 - ergrrmmap] 


represented by the continuous Jine in Fig. 2. The origin is at mode = mean = 2-4, 


* Professor Edgeworth calls this x?=40, n’=6 but only states the calculated (relative) frequencies 
for four compartments. 
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110 


Frequency 


Number of Balls drawn 


Fic. 2. ©@©© Theoretical Distribution of Black Balls in 210 draws of 4 balls out of a bag 
containing 4 white and 6 black balls, i.e. terms of the hypergeometrical series F(—4, —6, 1, 1). 


Continuous Line: Type II Frequency Curve 


; a“ 277 9-393812 
y = 10106905 [2 : (qa | 


The table below gives the theoretical frequencies deduced from this equation and the values 


given by Professor Edgeworth on p. 471 of his paper*. 


| 


TABLE IV. 
| | 
Number of Black Balls drawn 0 Loe | Zz 3 4 | Totals 
Beton —. <r a = S| | 
Data Hea Sn ex ae 1 24 90 80 | 15 210 
SSS 
Sum = 25 
Type IL... ee as bee ‘677 | 24-322 | 90-669 79-051 | 15-036 209-755 
SS | 
Sum = 24-999 
Professor Edgeworth’s curve, eee ae 
p. 471 a Aa 28-39 | 86°71 76:56 28-44 | 220-10 
| | | 


* Professor Edgeworth’s are relative frequencies. These have been multiplied by 210 to obtain 


the entries in the table. 
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The value of y? for the Type II curve is only -016 while »’ = 4. This value is so small as to 
be outside Elderton’s table, so we put 


aes a 
= ee | e 2X dy + Ve eX’ y, 
wi} x 7 


and from Sheppard’s table find the first term to be -89894 while the second is -10052 so that 
P = -99946. The fit is practically perfect. This is not surprising since we have fitted a curve 
which is known to be the so-called “parallel” to the hypergeometrical series to a hypergeometrical 
distribution. The value of y? for Professor Edgeworth’s curve is 7:035 which with n’ = 4 gives 
P=-071. The sum S (e?/U) is 000077 for the Type II curve and -061* for Professor Edgeworth’s 
curve. He says (p. 470) that “the calculated frequencies correspond fairly well with the observed 
frequencies.” The divergence between the total for the data (210) and for the calculated fre- 
quencies (220-1) is extraordinarily great and suggests that some errors may have crept into 
Professor Edgeworth’s arithmetic. The values of S (e?/u) are -000077 for the Type II curve and 
-0335 for Professor Edgeworth’s curve. 


Examee III. Distribution according to ages, of the Marriages of 
235,252 Spinsters. 


This exampie is described by Professor Edgeworth as one of ‘“‘considerable abnormality,” and 
he states (p. 462) “I should be considerably surprised if the Pearsonian types would stand the 
Pearsonian criterion in the case of Dr Isserlis’ second example which consists of the distribution 
of 235,252 ages at marriage.” 

The answer is that at any rate they stand the test much better than the curve fitted to this 
example by Professor Edgeworth. That the fit is a poor one is probably due not so much to 
the size of the total frequency (as Professor Edgeworth suggests) but to heterogeneity of the 
material t, due to misstatement of age. 


The observed frequencies are as follows: 


TABLE V. 


Np Bee | 2o— 45— | 50— | 55— | 60— (650 eliotn 
£ 


[Ss 
na 


Age 7) L 20 


| 
I a | 
Frequency 17546 | ri 1854 704 11 | 20241 5873 1706 | 636 | 171 64 | 28 23 Jl | 235252 


Again quoting from the Phil. ie (loc. cit. p. 385): with arbitrary origin at 22-5 years and 
unit 5 years, the mean is at -5227798, the moments are py, = -8294195, pz = 1-2482896, 


pa = 59016221 whence 8, = 2-730913, 8, = 8-578766. These values of 8,, 8, indicate a curve 
of Type VI, but the criterion 


K = By (By + 3)?/4 (485 — 3)3)) (28, — 33, — 6) = 1-151975 


is so nearly unity, that (having regard to the large values of 8,, 8.) we may expect the transition 
curve of Type V to give nearly as good a fit. 


The curve of Type VI is 
y = (4:29787) 1014 (a ee 2-305660)6-599764 ge 18-410607 
with origin at 4-655444 years, mode at 22-625624 years, mean at 25-1138899 years and a measured 


in 5 year units. 


* Not -07 (Prof. Edgeworth’s value on p. 470). 

+ On this point cf. W. Palin Elderton, Frequency Curves and Correlation, pp. 142-3. 

t The Phil. Mag. value, 8,=8-77876 is wrong (though py, “3, #4 are correct). Professor Edgeworth 
has used the wrong value, but the error is not enough seriously to influence his figures. 


eer! 


The curve of Type V is 


Miscellanea 


y= (1-43044) 108 go 10-7294715 Cia 


221029363 


x 
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Here also 2 is in 5 year units, the origin is at 12-453948 years and the mode at 22-754053 years. 


The two curves are shown in Fig. 3. 
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y= (1:48044) LOM giv-729172 © 
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35 40 a5 so 
Age of Spinster at Marriage 


Distribution of Ages at Marriage of 235,252 Spinsters. 


22°1029363 
v 


55 60 


Actual Frequencies represented 


= 2-305660)*-599764 


gpl8-410607 


The observed and calculated frequencies are given in the table below. 


* Te. Professor Edgeworth’s relative theoretical frequencies multiplied by 235252. 


TABLE VI. 
Age at Marriage of Spinsters. 
Observed . Prof. Edgeworth’s* | 
ace frequency Type VI Type V curve, p. 469 
15— 17546 159694 | 16448+ 14633 
20— 118542 120458 | 120061] 123696 
25— 70411 69289 69508 68811 
30— 20241 21009 20907 19596 
39 — 5873 5896 5770 5740 
40— 1706 1732 1692 
45— 636 3 582 | St | 544 | | 
i= 171| S 191 |< 191 | 8 
55— 64 - || ve a Tee ill | » 2752 
60— 28| g 28 | & 30) a8 | 
65— 23| 2 2 |Z 13 | Ea 
10— 11 16§ 13§ 
ee 2 BEE tattz | ae 
Totals 235252 235253 235250 235228 


ft 16-184 years to 20 years. 


{ 12-454 years to 20 years. 


§ All beyond 70 years. 
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It is worth noting that with the exception of one age group (80—35) the theoretical frequencies 
corresponding to Professor Edgeworth’s curve are consistently worse than both the Type V and 
Type VI frequencies. 


The values of y?, S (e?/U), and S (e?/w) for the three curves are given by the following table: 


TABLE VII. 
| | 
| oe | 8/0) S (eu) 
| ie - seal | | 
| ype: V. oc eanlagee see 129 -00053 | 00055 
Type VI oe ae es eN eo 00090-00099 
Prof. Edgeworth’s curve — 868 00366 | 00369 | 


Neither Type V nor Type VI fits the data well, but all the criteria agree in making Professor 
Kdgeworth’s curve a much poorer fit than either of the others. It should be observed that in 
order to make comparison with Professor Edgeworth’s curve possible we have followed him in 
grouping all ages above 40 together. This gives a fictitious improvement to the Type V curve, 
for Type VI which depends on the first four moments will fit the “tail” of the experience better 
than Type V whose constants depend on the first three moments only. The value of py is of 
course heavily influenced by the marriages at the higher ages. 


The source of heterogencity in the spinster curve is misstatement (false or in ignorance) of 
age. J am indebted to Professor Pearson for the remark that Table V shows a lumping up of 
frequency before the decades. If we compare Type VI against the observed, we find a redundancy 
of women just below 30, i.e. at 25, and this is repeated at 45. Again the group 15—20 is increased 
by the minors who misstated their age as over 15. Professor Pearson informs me that these 
points have come up for consideration in the treatment of very ample statistics not only English 
but Italian, and in both cases the males give better results than the females. 


We may attempt to circumvent the women who object to entering a new decade by grouping 
in ten year periods round the decades. The following table shows that the removal of this source 
of heterogeneity at once renders Type VI a very good fit, the transition curve Type V a fair fit, 
but leaves Professor Edgeworth’s curve a very poor fit. 


TABLE VIII. 
as ee a | 

Observed | : : | Prof. Edgeworth’s 
| Age frequency | Type V1 Types | curve 

15— | 136088 | 136427 | 136509 138329 
,  25— 90652 | 90298 90415 88407 

3d— 7579 | 7628 1 7462 
| 45— 807 773 | 735 8492 
| 55— 92\ Sum | 99\ Sum 103) Sum 
| 65— | 34) =126 28 f =127 26/ =129 
| Totals 235252 235253 235250 | 235228 


We now have x? = 4:56 when n’ = 5 for the Type VI curve so that P = -34 indicating quite 
a good fit. For Type V the value of x? is 11-01 with n’ = 5 giving P = -027, while for Professor 
Edgeworth’s curve x? is 96-31 with n’ = 3 so that P is still negligible. 
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§ 3. Professor Edgeworth has on several occasions advocated the value of his generalized 
frequency curve, and endeavoured to illustrate its numerical advantages. On the present 
occasion he has introduced what appears to be an erroneous formula for testing its ‘“‘ goodness 
of fit.” In his paper he gives no theoretical discussion of the test. It would be well if Professor 
Edgeworth tried on some future occasion to give a demonstration of its validity. Any such attempt 
would have to account for the departure from Bernoulli’s law that accuracy varies as the square 
root of the number of observations. According to Professor Edgeworth’s test results obtained 
from 100 observations and from 1,000,000 would be equally accurate provided the relative 
frequencies were the same*! However taking him on his own ground we find his curves give 
a very poor fit compared with Pearson’s well-known types, and offer no easy systematical methods 
of treating frequency. He escapes what is the real hard work of curve fitting by adopting the 
higher moments calculated by the biometricians and then appears to say how very much shorter 
his methods are! We are prepared on any occasion to race him in fitting a frequency curve 
ab initio and getting a better fit as a result. 


Added April 12th, 1917. Professor Edgeworth in veply to a letter in which I suggested that 
there might be an error in his numerical values quoted in my Table IV, has replied that owing 
to a clerical error the frequency in the last column is wrong. On using his amended relative 
frequency, that entry becomes 18-35 instead of 28:44. With this alteration y? becomes 1-297 
instead of 7-035, so that the corresponding value of P is -733. This of course denotes a good 
fit, but one which is still very inferior to the Pearsonian Type II curve for which P=-999. 


III, Relation of the Mode, Median and Mean in Frequency Curves. 


By ARTHUR T. DOODSON, M.Sc. 


$1. It is well known that in frequency curves of a moderate degree of asymmetry the distance 
from the mode to the median is approximately two-thirds of the distance from the mode to the 
mean. So far as the author knows there has not been published any investigation of this property 
except in the case of one type of curve where 

Y = Yo xP ee. 

Professor Karl Pearson, in his memoir on “Skew Variation in Homogeneous Material” (Phil. 
Trans. 1895), remarks on this property of the curve and he gives an expression for the ratio of the 
two distances in the form 

Co OOO Mea QOO4 ie cance sncsincsesasenceseeeeseatecuesataeeers (1). 


This was obtained from a few particular cases by the method of least squares. 


§2. Let y = yy (x) represent a frequency curve which has a maximum at a = 0 and which 
is zero ata = — a, and atxz=a,. Asarule there is only one maximum in frequency curves of 
this type. On these assumptions Laplace has given a method of approximating to the definite 
integral, the limits of integration lying between «= — a, and ~=a,. Following Laplace, we 
assume 

Yen CA (Co =n Came © seienctet seine deeasuisie e's seairetiese ste sialeajerine (2), 
where y, is the maximum ordinate, or the mode. Since 2 vanishes with t we may assume 


CIB Ee oben Dok? Faas accu seisaas sea sevee sce sebesilesatins (3). 


* Cf. on the other hand Pearson’s x? test. Let there be two series of observations with populations 
N,, N, and same relative frequency leading to S (e?/w) =x9?._ The two values of x? would be x;?=N, x¢", 
— 3X0 — 4x02 


standard deviations of form 1/./N, and 1/,/N,. 
Biometrika x1 28 
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Then, taken between appropriate limits, 
| yax = Jue 5 = - 0b = [e-# (By + 2Bot + 3Bgt + 2.) dé .ceeseeeeeee (4). 
This method is most useful where the range of t is either — © to +o or0to+o. The first case 
gives the “whole” area N; the second gives N,,, the area from the mode to the upper limit of «. 
Therefore 


+x = 
INF = | ye” (B+ 3B3t? + ~..) dt = ygN a (By + 2B, + .-.)) cc--00sessesesereeeeeeeene (5), 
Ny = 3N + is ye © (2Bot + 4By3 + ...)di=4N + yy (By + 2! Ba + 3! Bg + «..)....--(6). 
0) 
Hence the area from the mode to the median is 
Yo (Bo + 20 Bg 3! Be + 22.) seeo2-.-osscsosoeeesanseeeeeeeennt (7). 
The evaluation of B,, B,, ... we shall perform by means of the method of indeterminate coefficients, 
using (3) and dx db 
as == (0) Gon psopandecooancaradaascdoascaccx02002000 8 
Th de t UP HO verses, (8) 


which is obtained by logarithmic differentiation of (2) with respect to ¢. De Morgan* gives the 
values of the first five coefficients in terms of the differential coefficients at the mode of log ¢ (zx). 
It may be shown that when ¢ (2) contains high exponents or a large number of factors the 
coefficients B,, B,, B,,... are of decreasing magnitude and the series (5) and (7) are convergent 
enough for use. This is generally the case with frequency curves. When the skewness is small 
the coefficients B,, By, ... must be small; in this case, then, if d be the abscissa of the median we 
have the area from the mode to the median approximately equal to yd, and usually we may 
neglect B,, Bs, ... in comparison with B,. Hence approximately 


B= Boy oR seve sasiees saacenwe sens cece ce oeeeeee neers (9). 
If we let D be the distance of the mean from the mode we have 


a, +a 
Di x [aya = < i Yoe—" & (Wot + BO gf? + 20.) dbs. cecevceeescoees (10), 
where C’, is the coefficient of ¢” in the expansion of x? in powers of ¢. Then from (5) we have 
_ Yo jo iaq we 3B, B, ali 
D=N v7 GGs + --) = Bot. 7 ee (11). 
Therefore to a first approximation Di= BBy = BOX Faseics cod csosescesteonesseseecseeeceeeeene (12). - 


This is the relation that has been noticed in practical statistical work and it holds under the 
conditions stated. 


§3. The relation just found is quite general provided that the frequency curves are moderately 
asymmetrical. It would, of course, be possible to give general formulae for the more accurate 
representation of the relation between the distances of the mean and median from the mode in 
the very general type of curve already used. But in practice nearly all the types of frequency 
curves are included in Professor Pearson’s system and it is for these curves that we shall now 
obtain a closer approximation to the true value of d/D. In the memoir already quoted Professor 
Pearson takes the differential equation of frequency curves in the form 


l dy ay 
de maee sad ee (13), 
the origin of x being at the mode. 
If we write y = e—* we obtain, as in (8), 
di 22 
os se Atanet At(a, +042) One ee (14), 
and writing i ole OF A Ea OFS Hake Sennen REP ERA SACHA saakmEncunGooACO OCC (15), 


* Differential and Integral Calculus. p. 602. 


2 = — LY 
Bi? = — 2a jy eens 

B, = ! ( Ay” + aga ne 3 

ee (9°. 82 epee nue 
io eee LOSS Birr a oe 
B= - ( aaa Wa rt aut?) mn ae Oe 8 aa, 44 2 
Bi\540a, 30 12 ; Bo = 50h aie 945 a, 315 1 

ae 1 ( 139 a8 53 a,‘a, ‘ a7? — Agi" pean) 2 a" 2 aa, 2 a,3a,? 2. as 

A B, \340200a,2_ 12600 a, ° 4200 8 ® = 95515 a,) 2835 a,2 | 2835 a, | 63° 
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we have the following equations for the determination of B,, By, ... 


C, + 2a, = 0 C, = BY 
cae? >. oe neath A | Seal a (16) 
4C, + 4a,C', + 4a,B, = 0 C, = 2B,B; + B2 
5C; + 4a,0, + 4a,B, = 0 C; = 2B,B, + 2B,B, | 


eee e cece eceeeeeeee = = we eee eerrerereeree 


The solution of these equations gives the following values of B,, Bg, ... 


eee cece esr eeceec ee eseeeeee eee e ere reece evens eesecee 


The area from the mode to the median is then 


Pee 8 os Paps pace 4 16 1G» 
eens abe ue Obie? a ole) ae NI9Se SI 2 = O45? 


- 9 (58 a stor \+ 3 (sag + \4 I (18) 
Dict aie Peep inet Peas ceedec 5 


ag? (2835 * 945 % ap 


§4. Now this area can be found in another way in terms of d, the distance from the mode to 
the median, by assuming that ¢ (x) can be expanded in powers of x by Maclaurin’s Theorem. 
Professor Pearson* has shown that a high order parabola does not adequately fit these curves 
throughout the range, but if we assume that the skewness is small and therefore d small, we may 


use Maclaurin’s expansion legitimately within the range x = 0 tox =d. Since & is zero at the 
mode we have 
Pi (de ty Algae: AeA os te sea) ctelteisais'eneisonais vice Rewind vice (19), 
d 
and substituting in | ydx we have the required area equal to 
J0 
Yo (a+ & B+ fats = SE oe amas te ectssk Eaves secs (20) 


The values of the coefficients A,, Aj, ... may be obtained by using the differential equation (13) 
and the expansion (19); whence we obtain 
A, = (8 + 2) Agyo dy + (8 + 1) Ag. 1 a + SA, ay, 
1 a ay 
Fe ae ie aaa 


Thus the area required is 


and therefore A 


; {2 ri @G dia, n @(1- 2a.) da? da, | 

Yo 6a, 12a,2 40a,2 1 20a, 36a! are sof ee eececcececees 
This expression is in descending powers of a) and so is the one obtained in (18). Hence we may 
assume 


* Biometrika, Vol. 11. p. 19. 


oaeee 
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and substituting for d in (21) we may then equate to (18). 
to the approximation taken throughout the course of the work. 


4 


Sia 48 
= bs Jal (cee eet 
Cigna (a5 63 


§ 5. 


2, 88 
Doe tN sii 8 Ne s1OR 


Ay + 


NG 

= 91 Ao SP boo 
GS oe 

1575. cara 


Ay” 


ds 


d, = a5 ( 


ane ( 


We have assumed that the skewness is small; this is so if a, be small, and as the 


<eyie neti 


After considerable reduction, we have, 


64 
5103 14175 
21688 
3444525 


1184 


ay 


+...) 


...(23). 


To) 


coefficient of ay? is approximately a,7/170 it will be seen that we have a fairly rapidly convergent 


series for the value of d expressed in terms of the coefficients of the differential equation. 


We 


now proceed to transform the series so as to obtain the value of d/D in terms of the skewness. 


It is easily shown from the differential equation that 


a, = — D(1 + 2a,) 


files va see heute esheretemeeeeee (24), 
dy = — po (1 + 3ay) + D? (1+a,)) 
where py is the second moment coefficient about the mean. 
2 2 2 2 1p \ 72 
Hence cain weed ; ise 2a») F ( gee! : gt a sutig sabe close seRe ESRI (25) 
ao po 14 3a, po 1+ 8a, 
ldy_ 


Now the differential equation is obtained on the assumption that 


represented for all practical purposes by ay + a,x + aox?; this implies that a3, dy, ... 
complete expansion are negligibly small and usually a, is small also. 


2 


Be 


and that f (x) is 


y de f(e) 
of the 
If we restrict a, and 


es (the square of the skewness) to the cases where they are small we may expand by the binomial 


theorem all the quantities involved in (22), and after some reduction we obtain 


d 


2 8 
Da G ‘cae 


+ (soar5 - 


where (sk.) = skewness. 


16, 2328 ae i a 
ites ee =) of (ae 945 22 + 675 % +... } (sk.) 
184 9424 1328 

127572 pA + <———_ (SK.)B + one ceseceeeececneeeoeees 26), 
5515 127575 a.) (sh. aaaasog Ot) (26) 


The values of the coefficients of the successive powers of the skewness are given in the 


following table: 


(sk.)° | (sk.)? (sk.)4 

| “66666667 01975309 00721144 
| 53333333 — :06772486 — :07387027 
| —- +76190476 09481481 — 

| -91428571 | — — 


00038554 


(sk.)® 


| 
ol 
a 
| 

| 


= A, 

bs 
aaa } Ag” 
| ag 


§ 6. The types of curves for which the value of d/D is given by (26) are given below* with 


the values of ag: 


Type I. 
Type III. 


Type IV. 


Type V. 


Type VI. 


aw\in xv 
yow(reZ)"(1-% 
Ay a\ ya 
Y = Ye (1+ 2) 5 


y= — 


Y= You Pe” 


Y= Yo (@ -— alam, 


mes 
> 


1 
a, =——__.. 
2m, + My 
a, = 0. 
Aay=—- =. 
2 2m 
1 
@=--. 
Pp 
5 1 
2° Mm, — Mm, 


* These are not in all cases referred to the mode as the origin of «x. 
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It is very doubtful whether the series can be simplified any further and the question of con- 
vergency is a very difficult one, for no simple law can be given for any of the coefficients involved. 
The series has been tested with curves of Types I and III and the results are tabled below: 


| | d/D from d/D 
2 : 

Type 2 | (sk-) formula (correct) [Rb 
0-1 | +1733333 -715706 -715702 + :€00004 
I 0-1 | -4160000 -719087 -719048 + -000039 
0:05 | :2582456 -696072 -696072 -000000 
0-05 | -4657500 -700078 -700181 — -000103 

0 | 0-1 -66871448 | -66871460 — -00000012 

IU 0 | 0-2 “670909 -670909 -000000 
0 0-5 -678394 | +678347 + -000047 

0 | 1:0 -69402 | -69315 + :00087 


These show that the formula gives very good results even for high values of the skewness. 
It will be noted that in the case of Type III curves we have quite good results even when the 
skewness is unity. If the equation of Type III be taken in the form 


(2 a\P 
y=yoe *{ 1+ =| 
Yo D 


bell : : : 
then the square of the skewness is eel and when the skewness is unity pis zero. But the value 


of d as given by (22) and (23) is 


dg - ++ 3 -3+..., 
BT Te ey 
and this formula is of no great value if p be small. The processes of § 5 give 
d,’ do’ d,’ 


d=dy + 


p+” (p+iP” (t+ pt 
by what is practically Montfort’s Theorem, and the transformed series is simpler and of more 
value than the untransformed series. 

The formula given in § 3 for the area from the mode to the median has also been tested and 
gives results similar to those for the ratio of d to D. This may be useful if the area on either side 
of the mode be required. 


In conclusion I tender my thanks to Professor Pearson for his valuable criticism and heip. 


IV. The Probable Error of a Mendelian Class Frequency. 
EDITORIAL. 


Dr Raymond Pearl in a recent paper entitled “‘The Probable Error of a Mendelian Class 
Frequency*,” provides a striking illustration of how the capable biologist needs a long con- 
tinued training in the logic of mathematics before he ventures into the field of probability. 
Dr Pearl writes: ‘‘With the increased volume of Mendelian experimentation there is an ever- 
growing need for adequate and clearly understood tests for the statistical significance of differences 
between observed results and expectation.” Now there is absolutely no obscurity about the test 
for a single class frequency in a Mendelian result. I cannot conceive that there ever could be or 
ever has been such obscurity in the mind of a trained mathematician. If on the basis of Mendelian 
theory the relative frequency of that class be p and the relative frequency of all other classes 
be 1 - p = g, then in observation samples of m, the frequencies of this particular class distribute 
themselves according to the binomial law N (p + g)™ in N samples. To determine the impro- 


* American Naturalist, Vol. tt. p. 144, March, 1917. 
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bability of any observed result we have only to calculate the terms of this series. The mean 
of this frequency distribution is well known to be mp, and its standard deviation Vmpg. When 
neither p nor q is very small, then for practical purposes (when m of course is of a fair size, not 
a few units) it is quite adequate to calculate the probabilities of occurrence in such a single class 
frequency by the tables of the probability integral. The Mendelian worker who wishes to be more 
exact has merely to evaluate the terms of the binomial, or if m be too large, to approximate to 
those terms by the aid of my Type III curve, which falls for the cases excepted above much 
closer to the binomial than the normal curve does*. 

Two observations may here be made: First, if according to Mendelian theory there must be 
zero frequency in any particular class, then the improbability of any occurrence in that class is 
infinite. Secondly, the x? test can be applied to any number whatever of Mendelian classes, 
to s classes individually and all the remainder, or in a particular case to one class and all the 
remainder. When applied to one class and all the remainder, it reduces to exactly the process 
given above, i.e. to the application of the probability table on the basis of the standard deviation 
V mpq; or, referring to the genesis of the x2 test, which started by replacing the binomial by a 
normal curve, to using if we please the binomial. There is thus no contradiction between the two 
methods. This absence of contradiction was indicated by me in Biometrika, Vol. 1x. p. 312, 
and was perfectly familiar to Weldon, who applied the binomial test to cases where it is proper 
to apply it. Now let us examine Dr Pearl’s remarks in the light of these observations. He 
writes of Weldon’s and Johannsen’s use of the standard deviation of the binomial for entering 
the probability tablest: ‘‘In the first place it assumes the Gaussian distribution of errors, an 
assumption not often strictly warranted, as Pearson has clearly shown, and in many cases 
grossly in error.” Now this statement requires considerable modification, (i) it does not assume 
the Gaussian law of errors, but the fact that the Gaussian or better Laplacian integral 
for practical purposes gives adequately the frequencies of the binomial when extreme values 
of m, p and q are not included. Dr Pearl makes no attempt to show that Weldon (or Johannsen 
for the matter of that) was dealing with such extreme cases, but repeats the suggestion of his 
previous papert that Weldon somehow blundered. (ii) Further Dr Pearl cites for my view on 
the subject my paper “On the Influence of Past Experience on Future Expectation§.” In 
that paper, which deals as we shall show below with an entirely different matter, the standard 
deviation is not Vmpq, but the standard deviation characteristic of a second sample, so that 
criticism on this ground is out of place. Incidentally, however, I do recite in that paper the 
warning as to the use of the Laplacian integral when m, p, or g are extreme, and it was open 
to Dr Pearl to investigate whether Weldon had used the x? test in such cases. 

Let us look again at Dr Pearl. He writes: “The y? test gives a measure of the goodness 
of fit of the whole distribution, and only that. Now besides being interested in that point the 
Mendelian worker quite as often wants to know, in addition, something about the probability 
that particular classes observed are significantly different from the expected. To that sort of 
knowledge the x? test helps him not at all. It is an ‘all or none’ sort of method||.” 

Now Dr Pearl’s italics absolutely give him away, for they demonstrate that he does not in 
the least understand how to handle the x? test or what it means! Any test applied to s out 
of n classes must of course include the remaining » — s classes as a single group, for their total 
frequency is fixed, when the frequencies of the s classes are given. The y? test can be applied 
to a single class or to any number of classes up to n — 1, and it can only be a total misunder- 
standing of the test which can lead Dr Pearl to say that it is an “‘all or none” sort of method. 
Every test is an ‘“‘all or none” test, if Dr Pearl means that it involves the total number observed. 


* The area of Type III curve up to any deviation from the mean is given by the incomplete 
T-function. Tables of that function have been long in hand and are approaching completion and 
publication. Delay has arisen only from urgent war work. 

+ Loc. cit., p. 144. { Journal of Experimental Zoology, Vol. x11. p. 203 et seq. 

§ Phil. Mag., March, 1907, pp. 365-378. \| Loc. cit., p. 145. 


ih 
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Again let me cite Dr Pearl: “‘The x? test leads to this absurdity: if I perform a Mendelian 
experiment in which I get ten thousand million offspring agreeing perfectly with expectation save 
for one lone individual (perhaps a mutation, perhaps a mistake in the record, or what not) which 
is of a sort not expected, then Pearson and the x? test agree that the probability is infinitely 
great that the ten thousand million do not follow Mendelian law!*” 


6e 


Now this paragraph is I fear ‘‘characteristically” partizan, for in my criticism of Dr Pearl’s 
“paper I said nothing whatever about ten thousand millions obeying or not obeying the Mendelian 
law. Dr Pearl in his paper on the inheritance of fertility in hens got very frequently (from my 
standpoint) exceptions to his Mendelian theory. He drew in fact blue balls from a bag which 
his theory asserted contained only green or red balls. As a result he rejected the “goodness 
of fit” theory, which provided as it ought to do an infinite improbability. I asserted that he must 
either “‘remould his theory or explain away his observations}”; it was not a question of my test 
being fallacious. That he appears to be doing now when he states that exceptions may be 
“mutations” or “mistakes in records” or “what nots,” or as he did in his original paper when 
he remarked of such exceptions that they are of *‘a type which is continually arising in Mendelian 
work.” In other words he then emphasised and now again emphasises the “elasticity” of his 
observations. That is certainly the horn of the dilemma upon which I was prepared to drive 
him. But he is not content with such elasticity of Mendelism, or with badness of records or 
with “‘“what nots,” he seeks to show that a true test of Mendelian theory will be as plastic as 
the observations are elastic, in which case he will have successfully polled both horns. 


‘ ” 


In order to achieve this end he starts with a paper of mine published in the Philosophical 
Magazine for 1907, which he describes as ‘‘a very important paper, which is apparently almost 
entirely unknown to biologists.” While feeling grateful for the compliment, I regret that it 
should be brought to the notice of biologists in regard to a matter which it cannot possibly cover. 
Let me briefly state the purport of that paper. <A bag, a population, or a theoretical frequency 
is presented, the distribution law of which is absolutely unknown. The number in this funda- 
mental distribution is supposed indefinitely great or else individuals must be returned before 
each draw. A first sample of is drawn of which p present one characteristic and q do not. 
A second sample of m is now drawn, what is the chance that r individuals of this m will present 
the characteristic ? 


I showed that the mean of second samples would be 


p m q-p : 
me te eI ee ee (i), 
and the standard deviation o would be given by 
Peper (tee WEP Uh iat gk! Me Taso x 
o=m @ + CIE 7 ¢ Soe 5) (1 aie 3) snag saccisteisistslesins (11), 


the whole frequency distribution being provided by a hypergeometrical series. It is this hyper- 
geometrical series which Dr Pear] proposes to employ as a test of “goodness of fit” in Mendelian 
class frequencies. 


At first sight it seems almost impossible to determine how it can be applied, for the Men- 
delian Law which is supposed known is the theoretical frequency, the bag or original population, 
which my memoir supposes to be-wnknown except as suggested by the first sample. The only 
adequate method of applying the theory—and then it would be using a sledge hammer to crack 
nuts—would be to suppose the first sample indefinitely large. In this case p/n represents the 
class frequency of Mendelian theory p, and the ratio of the observed sample m to the total 
possible Mendelian population n is vanishingly small, and accordingly there results 

mean = mp, a? = mpq, 
and the hypergeometrical series collapses into N (p + q)”. Thus we fall back on the obvious 
binomial test with its infinite improbability for the cases of p = 0, which Dr Pearl finds not 


* Loc. cit., p. 145. + Biometrika, Vol. rx. p. 312. 
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‘plastic’? enough for his observations. But this, the logical application of the method, is not 
what Dr Pearl adopts although he has commenced his paper by emphasising the “‘ever-growing 
need for adequate and clearly understood tests for the statistical significance of differences 
between observed results and expectation*” and cited in this respect the y? and the Vmpq tests 
both of which are criteria of goodness of fit of observed results as a sample of the results to be 
expected from an indefinitely large and rigidly proportioned theoretical population. 


What Dr Pearl actually does had better be stated in his own words (italics his): ‘‘The pro- 
posal which I wish to make for the expression of a Mendelian result is that the expectation be 
expressed as the quartile limits of each class of frequency in a second sample of the same size as the 
observed samplet.” 

Now it is very difficult to understand this sentence, for we are not told what the “expectation” 
is. If it is the Mendelian expectation, then why is the Mendelian expectation the second sample 
and not the first sample? And further why should its size be the same as that of the observation 
and not indefinitely large? 

If we seek enlightenment in the working of the illustrations provided we can find it on 
pp. 150-151. Here to begin with we have the observations of F. L. Platt on the mating of Blue 
Andalusian fowls set out. There are 58 offspring of the mating Blue x Blue of which almost 
7% instead of being the white, blue or black of Mendelian theory are ‘“‘dark red.” These 
‘““mutations,’’ “‘errors of record” or “‘what nots” are thrown in with the blacks and classed as 
“pigmented not blue” (elastic observations!). Dr Pearl now assumes a first sample of 58 to be 
given in the actual Mendelian proportions of 1: 2: 1, ie., 14:5: 29: 14-5, and he then uses (i) and 
(ii) to obtain the mean = 14-9833 and o = 4-6364 for testing the actual data. 

Now what is the real significance of this procedure? Clearly he has made the Mendelian 
theory absolutely plastic. He has taken it not as the rigid law of the population sampled, but 
as the variable distribution of a very small first sample of 58 from a population of which the 
actual law is absolutely unknown or rather merely suggested by a very small sample. The smaller 
the observed series, the weaker will be the theory. The artifice is clear when it is clearly 
expressed, but no biologist with the average biologist’s knowledge of mathematics would realise 
what Dr Pearl has done and he would thus run the danger of accepting Dr Pearl’s test as a 
valid one. Suppose the sampled population to contain 1000 blue balls, 800 green balls, 500 red 
balls and 10 yellow balls. A sample of 50 would as a rule produce no yellow balls, say it pro- 
duced 22 blue, 17 green and 11 red. This would be used to represent the “Mendelian” ratio 
22:17:11 but later samples of 50 might occasionally produce yellow balls. These it is true 
would not appear in the ‘Mendelian theory” of 22:17:11, but would give no indefinite 
improbability, because the Mendelian theory has been shifted from being the rigid law of an 
indefinitely large population, to being the chance distribution of a sample, of just the same size 
as the observations themselves. Its accuracy or weight fluctuates with the observations themselves, 
and the “goodness of fit” test suggested by this process becomes no test whatever of the accord- 
ance of theory and observation, for the theory is made to have just the same weight as the 
observations, and the fewer the observed data, the greater ceteris paribus will be the likelihood 
of accordance between the two samples. 

Shortly Dr Pearl’s method is entirely fallacious, as any trained mathematician would have 
informed Dr Pearl had he sought advice before publication. It is most regrettable that such 
extensions of biometric theory should be lightly published, without any due sense of responsibility, 
not solely in biological but in psychological journals. It can only bring biometry into contempt 
as a science if, professing a mathematical foundation, it yet shows in its manifestations most 
inadequate mathematical reasoning. 


* Loc. cit., p. 144. The italics are mine. + Loc. cit., p. 148. 
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